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1. INTRODUCTION 

Instance optimality: our model(s). Standard worst-case analysis of algorithms has often been criticized as 
overly pessimistic. As a remedy, some researchers have turned towards adaptive analysis where the execution 
cost of algorithms is measured as a function of not just the input size but also other parameters that capture 
in some ways the difficulty of the input instance. For example, for problems in computational geometry (the 
primary domain of the present paper), parameters that have been considered in the past include the output 
size (leading to so-called output-sensitive algorithms) [Kirkpatrick and Seidel 1986], the spread of an input 
point set (the ratio of the maximum to the minimum pairwise distance) [Erickson 2005], various measures 
of fatness of the input objects (e.g., ratio of circumradii to inradii) [Matousek et al. 1994] or clutteredness 
of a collection of objects [de Berg et al. 2002], the number of reflex angles in an input polygon, and so on. 

The ultimate in adaptive algorithms is an instance-optimal algorithm, i.e., an algorithm A whose cost is 
at most a constant factor from the cost of any other algorithm A' running on the same input, for every 
input instance. Unfortunately, for many problems, this requirement is too stringent. For example, consider 
the 2-d convex hull problem, which has 0(?tlogn) worst-case complexity in the algebraic computation tree 
model: for every input sequence of n points, one can easily design an algorithm A' (with its code depending 
on the input sequence) that runs in 0(n) time on that particular sequence, thus ruling out the existence of 
an instance-optimal algorithm. 1 

To get a more useful definition, we suggest a variant of instance optimality where we ignore the order in 
which the input elements are given, as formalized precisely below: 


Definition 1.1. Consider a problem where the input consists of a sequence of n elements from a domain 
D. Consider a class A of algorithms. A correct algorithm refers to an algorithm that outputs a correct answer 
for every possible sequence of elements in T>. 

For a set S of n elements in 22, let Ta(S) denote the maximum running time of A on input a over all 
n\ possible permutations a of S. Let OPT(S') denote the minimum of Ta'(S) over all correct algorithms 
A' G A. If A € A is a correct algorithm such that Ta(S) < 0(1) ■ OPT(S) for every set S, then we say A is 
instance-optimal in the order-oblivious setting. 

For many problems, the output is a function of the input as a set rather than a sequence, and the above 
definition is especially meaningful. In particular, for such problems, instance-optimal algorithms are auto¬ 
matically optimal output-sensitive algorithms; in fact, they are automatically optimal adaptive algorithms 
with respect to any parameter that is independent of the input order, all at the same time! This property 
is satisfied by simple parameters like the spread of an input point set S 7 or more complicated quantities like 
the expected size f r (S) of the convex hull of a random sample of size r from S [Clarkson 1994]. 

For many algorithms (e.g., quickhull [Preparata and Shanros 1985], to name one), the running time is 
not affected so much by the order in which the input points are given but by the relative positions of the 
input points. Combinatorial and computational geometers more often associate “bad examples” with bad 
point sets rather than bad point sequences. All this supports the reasonableness and importance of the 
order-oblivious form of instance optimality. 

We can consider a still stronger variant of instance optimality: 

Definition 1.2. For a set S of n elements in 22, let T^ e (S) denote the average running time of A on input 
a over all n! possible permutations er of S. Let OPT avs (5') denote the minimum of T^( s (S) over all correct 


1 The length of the program for A' may depend on n in this example. If we relax the definition to permit the “constant factor” 
to grow as a function of the program length of A 1 , then an instance-optimal algorithm A exists for many problems such as 
sorting (or more generally problems that admit linear-time verification). This follows from a trick attributed to Levin [Jones 
1997], of enumerating and simulating all programs in parallel under an appropriate schedule. To say that algorithms obtained 
this way are impractical, however, would be an understatement. 
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Fig. 1. (a) A “harder” point set and (b) an “easier” point set for the 2-d upper hull problem, (c) A “harder” point set and (d) 

an “easier” point set for the 2-d maxima problem. 


algorithms A' € A. If A € A is a correct algorithm such that Ta(S ) < 0(1) • 0PT avg (5') for every set 5, 
then we say A is instance-optimal in the random-order setting. 2 

Note that an instance-optimal algorithm in the above sense is immediately also competitive against ran¬ 
domized (Las Vegas) algorithms A ', by the easy direction of Yao’s principle. The above definition has extra 
appeal in computational geometry, as it is common to see the design of randomized algorithms where the 
input elements are initially permuted in random order [Clarkson and Shor 1989]. 

Instance-optimality in the random-order setting also implies average-case optimality where we analyze 
the expected running time under the assumption that the input elements are random and independently 
chosen from a common given probability distribution. (To see this, just observe that the input sequence is 
equally likely to be any permutation of S conditioned to the event that the set of n input elements equals any 
fixed set S .) An algorithm that is instance-optimal in the random-order setting can deal with all probability 
distributions at the same time! Random-order instance optimality also remedies a common complaint about 
average-case analysis, that it does not provide information about an algorithm’s performance on a specific 
input. 

Convex hull: our main result. After making the case for instance-optimal algorithms under our definitions, 
the question remains: do such algorithms actually exist, or are they “too good to be true”? Specifically, 
we turn to one of the most fundamental and well known problems in computational geometry—computing 
the convex hull of a set of n points. Many 0(n log n)-time algorithms in 2-d and 3-d have been proposed 
since the 1970s [de Berg et al. 1997; Edelsbrunner 1987; Preparata and Sliamos 1985], which are worst- 
case optimal under the algebraic computation tree model. Optimal output-sensitive algorithms can solve 
the 2-d and 3-d problem in O(nlogh) time, where h is the output size. The first such output-sensitive 
algorithm in 2-d was found by [Kirkpatrick and Seidel 1986] in the 1980s and was later simplified by [Chan 
et al. 1997] and independently [Wenger 1997]; a different, simple, optimal output-sensitive algorithm was 
discovered by [Chan 1996b]. In 3-d, the first optimal output-sensitive algorithm was obtained by [Clarkson 
and Shor 1989] using randomization; another version was described by [Clarkson 1994]. The first deterministic 
optimal output-sensitive algorithm in 3-d was obtained by [Chazelle and Matousek 1995] via derandomization; 
the approach by [Chan 1996b] can also be extended to 3-d and yields a simpler optimal output-sensitive 
algorithm. There are also average-case algorithms running in 0(n) expected time for certain probability 
distributions [Preparata and Shamos 1985], for example, when the points are independent and uniformly 
distributed inside a circle or a constant-sized polygon in 2-d, or a ball or a constant-sized polyhedron in 3-d. 

The convex hull problem is in some ways an ideal candidate to consider in our models. It is not difficult 
to think of examples of “easy” point sets and “hard” point sets (see Figure l(a,b)). It is not difficult to 
think of different heuristics for pruning nonextreme points, which may not necessarily improve worst-case 
complexity but may help for many point sets encountered “in practice” (e.g., consider quickhull [Preparata 
and Shamos 1985]). However, it is unclear whether there is a single pruning strategy that works best on all 
point sets. 

In this paper, we show that there are indeed instance-optimal algorithms for both the 2-d and 3-d convex 
hull problem, in the order-oblivious or the stronger random-order setting. Our algorithms thus subsume all 
the previous output-sensitive and average-case algorithms simultaneously, and are provably at least as good 
asymptotically as any other algorithm for every point set, so long as input order is ignored. 


2 One can also consider other variations of the definition, e.g., relaxing the condition to T^ vg (S) < 0(1) ■ OPT avg (S), or replacing 
expected running time over random permutations with analogous high-probability statements. 
























Techniques. We believe that our techniques—for both the upper-bound side (i.e., algorithms) and the 
lower-bound side (i.e., proofs of their instance optimality)—are as interesting as our results. 

On the upper-bound side, we find that in the 2-d case, a new algorithm is not necessary: a version of 
Kirkpatrick and Seidel’s output-sensitive algorithm [Kirkpatrick and Seidel 1986], or its simplification by 
[Chan et al. 1997], is instance-optimal in the order-oblivious and random-order setting. We view this as a 
plus: these algorithms are simple and practical to implement [Bhattacharya and Sen 1997], and our analysis 
sheds new light on their theoretical complexity. In particular, our result immediately implies that a version 
of Kirkpatrick and Seidel’s algorithm runs in 0{n) expected time for points uniformly distributed inside a 
circle or a fixed-size polygon—we were unaware of this fact before. (As another plus, our result provides a 
positive answer to the question in the title of Kirkpatrick and Seidel’s paper, “The ultimate planar convex 
hull algorithm?”) 

In 3-d we propose a new algorithm, as none of the previous output-sensitive algorithms seems to be 
instance-optimal. For example, known 3-d generalizations of the Kirkpatrick-Seidel algorithm have subopti- 
mal 0(n log 2 h) running time [Chan et al. 1997; Edelsbrunner and Shi 1990], while a straightforward imple¬ 
mentation of the algorithm by [Chan 1996b] fails to be instance-optimal even in 2-d. Our algorithm builds 
on Chan’s technique [Chan 1996b] but requires additional ideas, notably the use of partition trees [Matousek 
1992], 

The lower-bound side requires more innovation. We are aware of three existing techniques for proving worst- 
case 52(?rlogn) (or output-sensitive tt(n\ogh)) lower bounds in computational geometry: (i) information- 
theoretic or counting arguments, (ii) topological arguments, from early work by [Yao 1981] to [Ben-Or 1983], 
and (iii) Ramsey-theory-based arguments, by [Moran et al. 1985]. Ben-Or’s approach is perhaps the most 
powerful and works in the general algebraic computation tree model, whereas Moran et al.’s approach works 
in a decision tree model where all the test functions have a bounded number of arguments. For an arbitrary 
input set S for the convex hull problem, the naive information-theoretic argument gives only an fl(hlogh) 
lower bound on OPT(5'). On the other hand, topological and Ramsey-theory approaches seem unable to give 
any instance-specific lower bound (for example, modifying the topological approach is already nontrivial if 
we just want a lower bound for some integer input set [Yao 1991], let alone for every input set, whereas the 
Ramsey-theory approach considers only input elements that come from a cleverly designed subdomain). 

We end up using a different lower bound technique which is inspired by an adversary argument originally 
used to prove time-space lower bounds for median finding [Chan 2010]. Note that this approach can lead to 
another proof of the standard fi(nlog?r) lower bounds for many geometric problems including the problem 
of computing a convex hull; the proof is simple and works in any algebraic decision tree model where the 
test functions have at most constant degree and have at most a constant number of arguments. We build on 
the idea further and obtain an optimal lower bound for the convex hull problem for every input point set. 
The assumed model is more restrictive: the class A of allowed algorithms consists of those under a decision 
tree model in which the test functions are multilinear and have at most a constant number of arguments. 
Fortunately, most standard primitive operations encountered in existing convex hull algorithms satisfy the 
multilinearity condition (for example, the standard determinant test does). The final proof interestingly 
involves partition trees [Matousek 1992], which are more typically used in algorithms (as in our new 3-d 
algorithm) rather than in lower-bound proofs. 

So, what is OPT(S'), i.e., what parameter asymptotically captures the true difficulty of a point set S for 
the convex hull problem? As it turns out, the bound has a simple expression (to be revealed in Section 3) 
and shares similarity with entropy bounds found in average-case or expected-case analysis of geometric data 
structures where query points come from a given probability distribution these distribution-sensitive results 
have been the subject of several previous pieces of work [Arya et al. 2007a; Arya et al. 2007b; Collette et al. 
2012; Dujmovic et al. 2012; Iacono 2004]. However, lower bounds for distribution-sensitive data structures 
cannot be applied to our problem because our problem is off-line (lower bounds for on-line query problems 
usually assume that the query algorithms fit a “classification tree” framework, but an off-line algorithm may 
compare a query point not only with points from the data set but also with other query points). Furthermore, 
although in the off-line setting we can think of the query points as coming from a discrete point probability 
distribution, this distribution is not known in advance. 3 Lastly, distribution-sensitive data structures are 
usually concerned with improving the query time, but not the total time that includes preprocessing. 


3 Self-improving algorithms [Ailon et al. 2011] also cope with the issue of how to deal with unknown input probability distribu¬ 
tions, but are not directly comparable with our results, since in their setting each point can come from a different distribution, 
so input order matters. 



Other results. The computation of the convex hull is just one problem for which we are able to obtain 
instance optimality. We show that our techniques can lead to instance-optimal results for many other standard 
problems in computational geometry, in the order-oblivious or random-order setting, including: 

(a) maxima in 2-d and 3-d; 

(b) reporting/counting intersection between horizontal and vertical line segments in 2-d; 

(c) reporting/counting pairs of Loo-distance at most 1 between a red point set and a blue point set in 2-d; 

(d) off-line orthogonal range reporting/counting in 2-d; 

(e) off-line dominating reporting in 2-d and 3-d; 

(f) off-line halfspace range reporting in 2-d and 3-d; and 

(g) off-line point location in 2-d. 

Optimal expected-case, entropy-based data structures for the on-line version of (g) are known [Arya et al. 
2007b; Iacono 2004], but not for (e,f)- -for example, [Dujmovic et al. 2012] only obtained results for 2-d 
dominance counting, a special case of 2-d orthogonal range counting. Incidentally, as a consequence of our 
ideas, we can also get new optimal expected-case data structures for on-line 2-d general orthogonal range 
counting and 2-d and 3-d halfspace range reporting. 

Related work. [Fagin et al. 2003] first coined the term “instance optimality” (when studying the problem 
of finding items with the k top aggregate scores in a database in a certain model), although some form of 
the concept has appeared before. For example, the well known “dynamic optimality conjecture” is about 
instance optimality concerning algorithms for manipulating binary search trees (see [Demaine et al. 2009] 
for the latest in a series of papers). [Demaine et al. 2000] studied the problem of computing the union or 
intersection of k sorted sets and gave instance-optimal results for any k for union, and for constant k for 
intersection, in the comparison model; [Barbay and Chen 2008] extended their result to the computation 
of convex hull of k convex polygons in 2-d for constant k. Another work about instance-optimal geometric 
algorithms is by [Baran and Demaine 2005], who addressed an approximation problem about computing the 
distance of a point to a curve under a certain black-box model. Other than these, there has not been much 
work on instance optimality in computational geometry, especially concerning the classical problems under 
conventional models. 

The concept of instance optimality resembles competitive analysis of on-line algorithms. In fact, in the 
on-line algorithms literature, our order-oblivious setting of instance optimality is related to what [Boyar 
and Favrholdt 2007] called the relative worst order ratio, and our random-order setting is related to what 
[Kenyon 1996] called the random order ratio. What makes instance optimality more intriguing is that we are 
not bounding the objective function of an optimization problem, but rather the running time of an algorithm. 

2. WARM-UP: 2-D MAXIMA 

Before proving our main result on convex hull, we find it useful to study a simpler problem: maxima in 2-d. 
For two points p and q we say p dominates q if each coordinate of p is greater than that the corresponding 
coordinate of q. Given a set S of n points in R d , a point p is maximal if p £ S and p is not dominated by any 
other point in S. For simplicity, we assume that the input is always nondegenerate throughout the paper (for 
example, no two points share the same x- or y-coordinate). The maxima problem is to report all maximal 
points. 

For an alternative formulation, we can define the orthant. at a point p to be the region of all points that 
are dominated by p. In 2-d, the boundary of the union of the orthants at all p € S forms a staircase , and 
the maxima problem is equivalent to computing the staircase of S. 

This problem has a similar history as the convex hull problem: many worst-case 0(n log n)-time algorithms 
are known; an output-sensitive algorithm by [Kirkpatrick and Seidel 1985] runs in 0(n log h ) time for output 
size h: and average-case algorithms with 0(n) expected time have been analyzed for various probability 
distributions [Bentley et al. 1990; Clarkson 1994; Preparata and Shamos 1985]. The problem is simpler than 
computing the convex hull, in the sense that direct pairwise comparisons are sufficient. We therefore work 
with the class A of algorithms in the comparison model where we can access the input points only through 
comparisons of the coordinate of an input point with the corresponding coordinate of another input point. 
The number of comparisons made by an algorithm yields a lower bound on the running time. 

We define a measure H(S) to represent the difficulty of a point set S and prove that the optimal running 
time OPT(S') is precisely 0(n('H(S') + 1)) for the 2-d maxima problem in the order-oblivious and random- 
order setting. 
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Fig. 2. Three respectful partitions of an instance of the 2-d maxima problem. The two partitions on the left have entropy 
j 2 log 12 + yjj- log -y- + log -y « 1.281. The partition II ver t on the right has higher entropy log 3 ~ 1.585. 


Definition 2.1. Consider a partition II ol the input set S into disjoint subsets S\,...,St. We say that 
II is respectful if each subset Sk is either a singleton or can be enclosed by an axis-aligned box Bk 
whose interior is completely below the staircase of S. Define the entropy H(II) of the partition II to be 
Tfc=i(l‘5'fel/ rl ) log(n/|iSfc|). Define the structural entropy 'H(S) of the input set S to be the minimum of 'H(II) 
over all respectful partitions II of S. 


Remark 2.2. Alternatively, we could further insist in the definition that the bounding boxes Bi are 
nonoverlapping and cover precisely the staircase of S. However, this will not matter, as it turns out that 
the two definitions yield asymptotically the same quantity (this nonobvious fact is a byproduct of our lower 
bound proof in Section 2.2). 

Entropy-like expressions similar to 'H(n) have appeared in the analysis of expected-case geometric data 
structures for the case of a discrete point probability distribution, although our definition itself is nonproba- 
bilistic. A measure proposed by [Sen and Gupta 1999] is identical to "H^vert) where n vert is a partition of S 
obtained by dividing the point set S by h vertical lines at the h maximal points of S (see Figure 2(right) for 
an illustration). Note that H(n vert ) is at most log/i (see Figure 1(c)) but can be much smaller; in turn, 'H(S') 
can be much smaller than 'H(n ver t) (see Figures 1(d) and 2). The complexity of the 1-d multiset sorting 
problem [Munro and Spira 1976] also involves an entropy expression associated with one partition, but does 
not require taking the minimum over multiple partitions. 

2.1. Upper bound 

We use a slight variant of Kirkpatrick and Seidel’s output-sensitive maxima algorithm [Kirkpatrick and Seidel 
1985] (in their original algorithm, only points from Qi are pruned in line 4): 

maxima2d(Q): 

1. if |Q| = 1 then return Q 

2. divide Q into the left and right halves Qe and Q r by the median ^-coordinate 

3. discover the point q with the maximum y-coordinate in Q r 

4. prune all points in Qg and Q r that are dominated by q 

5. return the concatenation of maxima2d(Q£) and maxima2d(<5 T .) 

We call maxima2d(S') to start: Figure 3 illustrates the state of the algorithm after a single recursion level. 
Kirkpatrick and Seidel showed that its running time is within 0(n log h), and [Sen and Gupta 1999] improved 
this upper bound to 0(n(7I(n ver t) +1)). Improving this bound to 0(n(H(Il) +1)) for an arbitrary respectful 
partition n of S requires a bit more finesse: 

Theorem 2.3. Algorithm. maxima2d(S') runs in 0(n('H(S) + 1)) time. 

Proof. Consider the recursion tree of the algorithm and let Xj denote the sublist of all maximal points 
of S discovered during the first j recursion levels, in left-to-right order. Let S^~> be the subset of points of S 
that survive recursion level j, i.e., that have not been pruned during levels 0,..., j of the recursion, and let 
nj = |S< J ')|. The running time is asymptotically bounded by n i- Observe that 

(i) there can be at most [n/2- 7 ] points of with x-coordinates between any two consecutive points in 
Xj , and 

(ii) all points of S that are strictly below the staircase of Xj have been pruned during levels 0,..., j of the 
recursion. 
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Fig. 3. Partial execution of maxima(.S') after one 
recursion level. In this example, after computing 
the median ^-coordinate, the algorithm found the 
highest point q 2 to the right of the median, and 
pruned the 6 points dominated by it. Only 5 points 
are left to recurse upon, 1 to the left and 4 to the 
right. 


Let II be any respectful partition of S. Consider a subset Sk in II. Let Bk be a box enclosing Sk whose 
interior lies below the staircase of S. Fix a level j. Suppose that the upper-right corner of Bk has cc-coordinate 
between two consecutive points qi and qi+i in Xj. By (ii), the only points in Bk that can survive level j 
must have ^-coordinates between qi and qi+i- Thus, by (i), the number of points in Sk that survive level j 
is at most min {|5fe|, [n/2- 7 ]}. Since the SVs cover the entire point set, with a double summation we have 

[log n\ 

< y 5Z min {i s ' fc i’ \ n / 2 ^} 

i=o fc 

[log n] 

= Y min {i-s'feU r^/2- 7 !} 

fc j=0 

— y ' (l^fclriog(n/|S , fc|)] + ISfcl + |S , fc|/2 + |S fc |/4 + • • • + 1) 

fc 

< Y |5 fc |(flog(n/|5fc|)l +2) 

k 

G 0(n(H{ n) + l)). 

As II can be any respectful partition of 5, it can be in particular the one of minimum entropy, hence the 
final result. □ 

2.2. Lower bound 

For the lower-bound side, we first provide an intuitive justification for the bound £l(n('H(S)+l)) and point out 
the subtlety in obtaining a rigorous proof. Intuitively, to certify that we have a correct answer, the algorithm 
must gather evidence for each point p eliminated why it is not a maximal point, by indicating at least one 
witness point in S which dominates p. We can define a partition II by placing points with a common witness 
in the same subset. It is easy to see that this partition II is respectful. The entropy bound nH(Tl) roughly 
represents the number of bits required to encode the partition II, so in a vague sense, nH(S) represents the 
length of the shortest “certificate” for S. Unfortunately, there could be many valid certificates for a given 
input set S (due to possibly multiple choices of witnesses for each nonmaximal point). If hypothetically all 
branches of an algorithm lead to a common partition II, then a straightforward information-theoretic or 
counting argument would indeed prove the lower bound. The problem is that each leaf of the decision tree 
may give rise to a different partition II. 

In Appendix A, we show that despite the aforementioned difficulty, it is possible to obtain a proof of 
instance optimality via this approach, but the proof requires a more sophisticated counting argument, and 
also works with a different difficulty measure. Moreover, it is limited specifically to the 2-d maxima problem 
and does not extend to 3-d maxima, let alone to nonorthogonal problems such as the convex hull problem. 

In this subsection, we describe a different proof, which generalizes to the other problems that we consider. 
The proof is based on an interesting adversary argument. We show in Section 4 how to adapt the proof to 
the random-order setting. 

Theorem 2.4. OPT(S') G f t{n(T-L{S) + 1)) for the 2-d maxima problem in the comparison model. 
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Fig. 4. The difficulty of proving a lower bound by counting certificates: each elimination of a point p must be justified by 
indicating a witness which dominates p. But there can be more than one such certificate for each instance, and some can be 
encoded in less space than others. Here the same instance is partitioned on the left into 3 subsets of equal size 4, while it is 
partitioned on the right into subsets of sizes 1, 7, 4 with lower entropy. 
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Fig. 5. The beginning of the recursive partitioning of S by the k-d tree 7”, which will yield the final partition n^d-tree for the 
adversarial lower bound for the 2-d maxima problem. The two bottom boxes are already leaves, while the two top boxes will 
be divided further. 


Proof. We prove that a specific respectful partition described below not only asymptotically achieves the 
minimum entropy among all the respectful partitions, but also provides a lower bound for the running time 
of any comparison-based algorithm that solves the 2-d maxima problem. The construction of the partition 
is based on k-d trees [de Berg et al. 1997]. We define a tree T of axis-aligned boxes, generated top-down as 
follows: The root stores the entire plane. For each node storing box B, if B is strictly below the staircase 
of S , or if B contains just one point of S, then B is a leaf. Otherwise, if the node is at an odd (resp. even) 
depth, divide B into two subboxes by the median x-coordinate (resp. y-coordinate) among the points of S 
inside B. The two subboxes are the children of B (see Figure 5 for an illustration). Note that each box B at 
depth j of T contains at least \n/2^\ points of S , and consequently, the depth j is in fi(log(n/|iS 0 B\)). 

Our claimed partition, denoted by Ilkd-tree > is one formed by the leaf boxes in this tree T (i.e., points 
in the same leaf box are placed in the same subset). Clearly, Ilkd-tree is respectful. We will prove that for 
any correct algorithm A in the comparison model, there exists a permutation of S on which the algorithm 
requires at least Q(riH (Ilkd-tree)) comparisons. 

The adversary constructs a bad permutation for the input by simulating the algorithm A and resolving 
each comparison so that the algorithm is forced to perform many others. During the simulation, we maintain 
a box B p in T for each point p. If B p is a leaf box, the algorithm knows the exact location of p inside B p . 
But if B p corresponds to an internal node, the only information the algorithm knows about p is that p lies 
inside B p . In other words, p can be assigned any point in B p without affecting the outcomes of the previous 
comparisons made. 

For each box B in T, let n(B) be the number of points p such that the box B p is contained in B. We 
maintain the invariant that n(B) < |S'n B\. If n(B) = |5fl B\, we say that B is full. As soon as B p becomes 
a leaf box, we assign p to an arbitrary point in S fl B p that has not been previously assigned (such a point 
exists because of the invariant); we then call p a fixed point. 






























Suppose that the algorithm A compares, say, the ^-coordinates of two points p and q. The main case is 
when neither B p nor B q is a leaf. The comparison is resolved in the following way: 

(1) If B. p (resp. Bq) is at even depth, we arbitrarily reset B p (resp. B q ) to one of its children that is not full. 
Thus assume that B p and B q are both at odd depths. 

Without loss of generality, suppose that the median ^-coordinate of B p is less than the median x- 
coordinate of B q . We reset B p to the left child B' of B p and B q to the right child B' of B q \ if either B' p 
or B q is full, we go to step 2. Now, the knowledge that p and q lie in B' and B' allows us to deduce that 
p has a smaller ^-coordinate than q. Thus, the adversary declares to the algorithm that the ^-coordinate 
of p is smaller than that of q and continues with the rest of the simulation. 

(2) An exceptional case occurs if B p is full (or similarly B q is full). Here, we reset B p instead to the left 
(resp. right) sibling B p of B', but the comparison is not necessarily resolved yet, so we go back to step 
1. 

Note that in both steps the invariant is maintained. This is because B p and B p cannot be both full: 
otherwise, we would have | S D B p \ = IS 1 (~l B p \ + |S D B p \ = n(B p ) + n(B "), but | S fl B p \ > n(B p ) > 
n(B') + n{B p ) + 1 (the “+1” arises because at least one point, notably, p, has B p as its box). 

Tne above description can be easily modified in the case when B p or B q is a leaf box. If both B p and 
B q are leaf boxes, then p and q are already fixed and the comparison is already resolved. If (without loss of 
generality) only B p is a leaf, we follow step 1 except that now since p has been fixed, we compare the actual 
^-coordinate of p to the median x-coordinate of B q , and reset only B q . 

We now prove a lower bound on the number of comparisons, T, made by the algorithm A. Let D be the 
sum of the depth of the boxes B p in the tree T over all points p £ S at the end of the simulation of the 
algorithm. We will lower-bound T in terms of D. Each time we reset a box to one of its children in step 1 
or 2, D is incremented; we say that an ordinary (resp. exceptional) increment occurs at the parent box if 
this is done in step 1 (resp. step 2). Each comparison generates only 0(1) ordinary increments. To account 
for exceptional increments, we use a simple amortization argument: At each box B in T , the number of 
ordinary increments has to reach at least LIS' fl B|/2J first, before any exceptional increments can occur, 
and the number of exceptional increments is at most [|S fl B |/2]. Thus, the total number of exceptional 
increments can be upper-bounded by the total number of ordinary increments, which is in 0(T). It follows 
that D £ 0(T), i.e., T £ 0(D). 

Thus, it remains to prove our lower bound for D. We first argue that at the end of the simulation, B q 
must be a leaf box for every point q £ S. Suppose that this is not the case. After the end of the simulation, 
we can do the following postprocessing: for every point p where B p corresponds to an internal node, we reset 
B p to one of its nonfull children arbitrarily, and repeat. As a result, every B p now becomes a leaf box, all 
the input points have been assigned to points of 5, and no two input points are assigned the same value, i.e., 
the input is fixed to a permutation of S. The staircase of this input obviously coincides with the staircase 
of S. Next, consider modifying this input slightly as follows. Suppose that B q was not a leaf box before the 
postprocessing. Then this box contained at least two points of S and was not completely underneath the 
staircase of S. We can either move a nonmaximal point upward or a maximal point downward inside B q , 
and obtain a modified input that is consistent with the comparisons made but has a different set of maximal 
points. The algorithm would be incorrect on this modified input: a contradiction. 

It follows that 


T £ f 1(D) 

c |SnB|k>g(n/|SnB|) j 

Vleaf B J 

C fl(?r"H(nkd-tree)) 

C n(riH(S)). 

Combined with the trivial f2(n) lower bound, this establishes the f l(n('H(S) + 1)) lower bound. □ 


Remark 2.5. The above proof is inspired by an adversary argument described by [Chan 2010] for a 1-d 
problem (the original proof maintains a dyadic interval for each input point, while the new proof maintains a 



box from a hierarchical subdivision ). 4 The proof still holds for weaker versions of the problem, for example, 
reporting just the number of maximal points (or the parity of the number). The lower-bound proof easily 
extends to any constant dimension and can be easily modified to allow comparisons of different coordinates 
of any two points p = (xi,..., xfi) and q = (x(,... ,x' d ), for example, testing whether Xj < x', or even 
Xi < x'j + a for any constant a. (For a still wider class of test functions, see the next section.) 

3. CONVEX HULL 

We now turn to our main result on 2-d and 3-d convex hull. It suffices to consider the problem of computing 
the upper hull of an input point set S in R d (d £ {2, 3}), since the lower hull can be computed by running 
the upper hull algorithm on a reflection of S. (Up to constant factors, the optimal running time for convex 
hull is equal to the maximum of the optimal running time for upper hull and the optimal running time for 
lower hull, on every input.) 

We work with the class A of algorithms in a multilinear decision tree model where we can access the input 
points only through tests of the form f(p ±,... ,p c ) > 0 for a multilinear function /, over a constant number 
of input points pi,... ,p c . We recall the following standard definition: 

Definition 3.1. A function / : (R d ) c —> R is multilinear if the restriction of / is a linear func¬ 
tion from R d to R when any c — 1 of the c arguments are fixed. Equivalently, / is multilinear if 
/((xn,..., xid),..., (x c i,... ,x c d)) is a multivariate polynomial function in which each monomial has the 
form Xj Ul • • • Xi k j k where ii ,..., ik are all distinct (i.e., we cannot multiply coordinates from the same point). 

Most of the 2-d and 3-d convex hull algorithms that we know fit in this framework: it supports the standard 
determinant test (for deciding whether pi is above the line through P 2 ,P 3 , or the plane through P 2 ,P 3 ,P 4 ), 
since the determinant is a multilinear function. For another example, in 2-d, comparison of the slope of 
the line through p±,p 2 with the slope of the line through P 3 ,p± reduces to testing the sign of the function 
(jj 2 ~ U i)(x 4 — X 3 ) — (X 2 — Xi)(y 4 — 2 / 3 ), which is clearly multilinear. We discuss in Section 5 the relevance 
and limitations of the multilinear model. 

We adopt the following modified definition of ’H(S') (as before, it does not matter whether we insist that 
the simplices A& below are nonoverlapping for both the 2-d and 3-d problem): 

Definition 3.2. A partition II of S is respectful if each subset Sk in II is either a singleton or can be enclosed 
by a simplex A*, whose interior is completely below the upper hull of S. Define the structural entropy H(S) 
of S to be the minimum of ’H(II) = X^fc(l‘^ fe l/ n ) log(n/|Sfc|) over respectful partitions II of S. 

3.1. Lower bound 

The lower-bound proof for computing the convex hull builds on the corresponding lower-bound proof for 
computing the maxima from Section 2.2 but is more involved, because a k-d tree construction no longer 
suffices when addressing nonorthogonal problems. Instead, we use the known following lemma: 

Lemma 3.3. For every set Q of n points in R d and 1 < r < n for any constant d, we can partition Q 
into r subsets Qi, ■. ■ ,Q r each of size ®(n/r) and find r convex polyhedral cells 71 ,... ,7 r each with 0 (log r) 
(or fewer) facets, such that Qi is contained in 7 i, and every hyperplane intersects at most 0 (r 1-e ) cells. 
Here, e > 0 is a constant that depends only on d. 

The above follows from the partition theorem of [Matousek 1992], who obtained the best constant e = 1/d ; 
in his construction, the cells 7 * are simplices (with 0 ( 1 ) facets) and may overlap, and subset sizes are 
indeed upper- and lower-bounded by ©(n/r). (A version of the partition theorem by [Chan 2012] can avoid 
overlapping cells, but does not guarantee an 0 (n/r) lower bound on the subset sizes.) 

In 2-d or 3-d, a more elementary alternative construction follows from the 4-sectioning or 8 -sectioning 
theorem [Edelsbrunner 1987; Yao et al. 1989]: for every n-point set Q in R 2 , there exist 2 lines that divide 
the plane into 4 regions each with n/4 points; for every n-point set Q in R 3 , there exist 3 planes that divide 
space into 8 regions each with n /8 points. Since in R 2 a line can intersect at most 3 of the 4 regions and 
in R 3 a plane can intersect at most 7 of the 8 regions, a simple recursive application of the theorem yields 
e = 1 — log 4 3 for d = 2 and e = 1 — log 8 7 for d = 3. Each resulting cell 7 * is a convex polytope with 0(log r) 
facets, and the cells do not overlap. 

We also need another fact, a geometric property about multilinear functions: 


4 There are also some indirect similarities to an adversary argument for sorting due to [Kahn and Kim 1995], as pointed out to 
us by Jean Cardinal (personal communication, 2010). 



Lemma 3.4. If f : (R d ) c —> R is multilinear and has a zero in 71 x • • • x j c where each ji is a convex 
polytope in R d , then f has a zero (pi ,... ,p c ) S 71 x • • • x 7 C such that all but at most one point pi is a 
polytope’s vertex. 

Proof. Let (p \,... ,p c ) £ 7 i x ■ ■ • x 7 c be a zero of /. Suppose that some p t does not lie on an edge 
of 7 j. If we fix the other c — 1 points, the equation / = 0 becomes a hyperplane, which intersects 7 , and 
thus must intersect an edge of 7 *. We can move pi to such an intersection point. Repeating this process, 
we may assume that every pi lies on an edge upai of 7 j. Represent the line segment parametrically as 
{(1 - U)ui + Uni | 0 < U < 1 }. 

Next, suppose that some two points pi and pj are not vertices. If we fix the other c — 2 points and restrict 
Pi and pj to he on upUi and ujvj respectively, the equation / = 0 becomes a multilinear function in two 
parameters ti,tj £ [0,1]. The equation has the form atitj + a'ti + a"tj + a'" = 0 and is a hyperbola, which 
intersects [0, l ] 2 and must thus intersect the boundary of [0, l] 2 . We can move pi and pj to correspond to 
such a boundary intersection point. Then one of pi and pj is now a vertex. Repeating this process, we obtain 
the lemma. □ 

We are now ready for the main lower-bound proof: 

Theorem 3.5. OPT(S') e fl(n(TL(S) + 1)) for the upper hull problem in any constant dimension d in 
the multilinear decision tree model. 

Proof. We define a partition tree T as follows: Each node v stores a pair (Q(t;), T(i>)), where Q(v) is a 
subset of S enclosed inside a convex polyhedral cell T(t>). The root stores (S, R d ). If T(t;) is strictly below 
the upper hull of S , or if |Q(c)| drops below a constant, then v is a leaf. Otherwise, apply Lemma 3.3 with 
r = b to partition Q{v ) and obtain subsets Qi,... ,Qb and cells 71 ,..., 7 b . For the children v\,...,Vb of v, 
set Q(vi ) = Qi and r(i 7 ) = 7 * fir(r). For a node v at depth j of the tree T we then have |Q(v)| > n/0(6) J , 
and consequently, the depth j is in fl(log b (n/|Q(c)|)). Furthermore, since r(u) is the intersection of at most 
j convex polyhedra with at most 0 (log 6 ) facets each, it has size (jlogfo) 0 ^. 

Let n part _ tree be the partition formed by the subsets Q(v) at the leaves v in T. Let n part _ tree be a refinement 
of this partition obtained as follows: for each leaf v at depth j, we triangulate T(v) into (j log b)° C) simplices 
and subpartition Q(v) by placing points of Q(v) from the same simplex in the same subset; if |Q(f)| drops 
below a constant, we subpartition Q(v) into singletons. Note that the subpartitioning of Q(v) causes the 
entropy to increase 5 by at most 0((\Q(v)\/n) log(j log b)) C 0((\Q(v)\/n) log log(n/|Q(u)|)) for any constant 
b. The total increase in entropy is thus within 0('H(n part _tree))- So 'H(n part -tree) £ ©(^(Ifpart-tree))- Clearly, 
n palt -tiee is respectful. 

The adversary constructs a bad permutation for the input points as follows. During the simulation, we 
maintain a node v p in 7" for each point p. If v p is a leaf, the algorithm knows the exact location of p inside 
r(i>p). But if v p is an internal node, the only information the algorithm knows about p is that p lies inside 

r M- 

For each node v in T, let n(v ) be the number of points p with v p in the subtree rooted at v. We maintain 
the invariant that n(v) < |Q(c)|. If n(v) = |Q(t;)|, we say that v is full. As soon as v p becomes a leaf, we fix 
p to an arbitrary unassigned point in Q{v p ) (such a point exists because of the invariant). 

Suppose that the simulation encounters a test “/(pi,... ,p c ) > 0?”. The main case is when none of the 
nodes v Pi is a leaf. 

(1) Consider a c-tuple (v' pi ,...,v' Pc ) where v p . is a child of v Pi . We say that the tuple is bad if / has a zero in 
p ( v Pl ) x • • • x 7 (v' Pc ), and good otherwise. We prove the existence of a good tuple by upper-bounding the 
number of bad tuples: If we fix all but one point pi, the restriction of / can have a zero in at most 0(6 1_£ ) 
cells of the form 7 (t; pi ), by Lemma 3.3 and the multilinearity of f. There are 0(5 c_ 1 log6) choices of 
c — 1 vertices of the cells of the form 'y(v pi ),..., j(v' Pc ). By Lemma 3.4, it follows that the number of bad 
tuples is at most 0((b c ~ 1 log b) ■ 6 1-£ ) C o{b c ). As the number of tuples is in 0(6 C ), if b is a sufficiently 
large constant, then we can guarantee that some tuple ( v pi ,..., v' Pc ) is good. We reset v Pi to v p . for each 
i = 1,..., c; if some v' p . is full, we go to step 2. Since the tuple is good, the sign of / is determined and 
the comparison is resolved. 


® If Ei= 1 Qi = 9 - then by concavity of the logarithm, J2i=i % lo § ^7 = f Ei=i ^ lo S 57 < ^ log = f lo S 7 + ^ lo § k ■ 



Fig. 6. Kirkpatrick and Seidel’s upper hull algorithm [Kirkpatrick and Seidel 1986] with an added pruning step. Line 5 prunes 
the points in the shaded trapezoid. The added step in line 2 prunes points in the two shaded triangles. 


(2) In the exceptional case when some v p . is full, we reset v Pi instead to an arbitrary nonfull child, and go 
back to step 1. 

The above description can be easily modified in the case when some of the nodes v Pi are leaves, 
some of the points pi are already fixed (we just have to lower c by the number of fixed points). 

Let T be the number of tests made. Let D be the sum of the depth of v p over all points p G S. 
amortization argument as in the previous proof of Theorem 2.4 proves that T € Q(D). By an 
similar to before, at the end of the simulation, v p must be a leaf for every p € S. It follows that 

t e n(D) 

c fi ( IQ(w)l lo g(n/|<9(w)|) 

Vleaf v 

C Q{jlhL (IIp ar t-tree)) 

C £l(nH (11 part-tree ) ) 

c £l(riH(S)). 

Combined with the trivial f l(n) lower bound, this establishes the theorem. □ 

The proof extends to weaker versions of the problem, for example, reporting the number of hull vertices 
(or its parity). 

3.2. Upper bound in 2-d 

To establish a matching upper bound in 2-d, we use a version of the output-sensitive convex hull algorithm by 
[Kirkpatrick and Seidel 1986] described below, where an extra priming step is added in line 2. (This step is not 
new and has appeared in both quickhull [Preparata and Shamos 1985] and the simplified output-sensitive 
algorithm by [Chan et al. 1997]; see Figure 6 for illustration.) 

hull2d(Q): 

1 . if |Q| = 2 then return <5 

2. prune all points from Q strictly below the line through the leftmost and 
rightmost points of Q 

3. divide Q into the left and right halves Qi and Q r by the median ^-coordinate p m 

4. discover points q , q' that define the upper-hull edge qq' intersecting the vertical 
line at p m (in linear time) 

5. prune all points from Qt and Q r that are strictly underneath the line segment qq' 

6 . return the concatenation of hull2d(<5^) and hull2d(Q r ) 

Line 4 can be done in 0(n) time (without knowing the upper hull beforehand) by applying a 2-d linear 
programming algorithm in the dual [Preparata and Shamos 1985]. We call hull2d(5') to start. It is straight¬ 
forward to show that the algorithm, even without line 2, runs in time 0(n log h), or 0(n(TL(Jl V ert ) + 1)) for 
the specific partition II vert of S obtained by placing points underneath the same upper-hull edge in the same 
subset, as was done by [Sen and Gupta 1999]. To upper-bound the running time by 0(n(H(J\) + 1)) for an 
arbitrary respectful partition II of S, we modify the proof in Theorem 2.3: 


i.e., when 

The same 
argument 






Theorem 3.6. Algorithm lmll2d(S) runs in 0(n('H(S) + 1)) time. 

Proof. Like before, let Xj denote the sublist of all hull vertices discovered during the first j levels of 
the recursion, in left-to-right order. Let be the subset of points of S that survive recursion level j, and 
rij = 15^^. The running time is asymptotically bounded by n i- Observe that 

(i) there can be at most \n/2Q points of with ^-coordinates between any two consecutive vertices in 
Xj, and 

(ii) all points that are strictly below the upper hull of Xj have been pruned during levels 0 ,..., j of the 
recursion. 

Let II be any respectful partition of S. Consider a subset Sk in II. Let A k be a triangle enclosing Sk whose 
interior lies below the upper hull of S. Fix a level j. If qi and qi+i are two consecutive vertices in Xj such 
that qiqi+i does not intersect the boundary of A k (i.e., is above A*,), then all points in A*, with ^-coordinates 
between qk and qk+i would have been pruned during the first j levels by (ii). Since only 0(1) edges qiqi+i 
of the upper hull of Xj can intersect the boundary of A&, the number of points in Sk that survive level j is 
at most min {\Sk\,0{n/2 : >)} by (i). We then have 

[log n] [log n] 

n i e min {|S'fc|, 0(n/2 J )} C + 1)) 

j=0 j—0 k 

as before. □ 

Remark 3.7. The same result holds for the simplified output-sensitive algorithm by [Chan et al. 1997], 
which avoids the need to invoke a 2-d linear programming algorithm. (Chan et al.’s paper explicitly added 
the pruning step in their algorithm description.) The only difference in the above analysis is that there can 
be at most |~(3/4) J n] points of S with ^-coordinates between any two consecutive vertices in Xj. 

3.3. Upper bound in 3-d 

We next present an instance-optimal algorithm in 3-d that matches our lower bound. Unlike in 2-d, it 
is unclear if any of the known algorithms can be modified for this purpose. For example, obtaining an 
0 (n('H(n vert ) + 1 )) upper bound is already nontrivial for the specific partition n ver t where points underneath 
the same upper-hull facet are placed in the same subset. Informed by our lower-bound proof, we suggest an 
algorithm that is also based on partition trees. We need the following subroutine: 

Lemma 3.8. Given a set ofn halfspaces in for any constant d, we can answer a sequence of r linear 
programming queries (finding the point that maximizes a query linear function over the intersection of the 
half spaces) in total time 0(nlogr + r°C^). 

The above lemma was obtained by Chan [Chan 1996b; 1996c] using a simple grouping trick (which was the 
basis of his output-sensitive 0(n log ft.)-time convex hull algorithm); the d > 3 case required randomization. A 
subsequent paper by [Chan 1996a] gave an alternative approach using a partition construction; this eliminated 
randomization. 

Our new upper hull algorithm can now be described as follows: 
hull3d(Q): 

1 . for j = 0,1,...., [log(<f log?i)J do 

2. partition Q by Lemma 3.3 to get rj = 2 2J subsets Q\, ..., Q rj and 
cells 7 i,... , 7 rj 

3. for each i = 1 to 77 do 

4. if 7 i is strictly below the upper hull of Q then prune all points in Qi from Q 

5. compute the upper hull of the remaining set Q directly 

Line 2 takes 0(|Q| log 77 + time by known algorithms for Matousek’s partition theorem [Matousek 

1992] (or alternatively recursive application of the 8 -sectioning theorem). The test in line 4 reduces to deciding 
whether each of the at most O (log 77 ) vertices of the convex polyhedral cell 7 ,; is strictly below the upper hull 
of Q. This can be done (without knowing the upper hull beforehand) by answering a 3-d linear programming 

query in dual space. Using Lemma 3.8, we can perform lines 3-4 collectively in time 0(|<5| log 77 + 77 *^). As 



Tj < n s , the r term is negligible, since its total over all iterations is sublinear in n by choosing a small 
constant S. Line 5 is done by running any 0(|Q| log |Q|)-time algorithm. 

Theorem 3.9. Algorithm hull3d(5) runs in 0(n('H(S) + 1)) time. 

Proof. Let nj be the size of Q just after iteration j. The total running time is asymptotically bounded 
by n j l°g r i+i- (This includes the cost of line 5, which is 0(nj log nj) € 0(nj log 77 + 1 ) for the last index 
j = |dog(d log n)].) 

Let II be any respectful partition of S. Consider a subset Sk in II. Let A*, be a simplex enclosing Sk 
whose interior lies below the upper hull of S. Fix an iteration j. Consider the subsets Q i,..., Q rj and cells 
7 i,..., 7 r . at this iteration. If a cell 7 is completely inside A*,, then all points inside 7 j are pruned. Since 
0(r^~ e ) cells 7 i intersect the boundary of A&, the number of points in Sk that remain in Q after iteration 
j is at most min { \Sk\, 0(r^ _e • n/rj )} = min {IS^I, 0(n/rj)}. The Sk s cover the entire point set, so with a 
double summation we have 

EVogrm < 0 (^St )}' 2J+1 

3 3 k 

k j 

^Y.°[ E i-w + E jin' 

k \j<log((l/e)log(n/|Sfe|)) + l j>log((l/e) log(n/|S fc |))+l 

e E°(l^l( lo sM^I) + i)) 

k 

€ 0(n{n(U) + l)), 
which yields the theorem. □ 

Remark 3.10. Variants of the algorithm are possible. For example, instead of recomputing the partition 
in line 3 at each iteration from scratch, another option is to build the partitions hierarchically as a tree. 
Points are pruned as the tree is generated level by level. 

One minor technicality is that the above description of the algorithm does not discuss the low-level test 
functions involved. In Section 5 we explain how a modification of the algorithm can indeed be implemented 
in the multilinear model. 

A similar approach works for the 3-d maxima problem in the comparison model. We just replace parti¬ 
tion trees with k-d trees, and replace linear programming queries with queries to test whether a point lies 
underneath the staircase, which can be done via an analog of Lemma 3.8. 

4. EXTENSION TO THE RANDOM-ORDER SETTING 

In this section, we describe how our lower-bound proofs in the order-oblivious setting can be adapted to 
the random-order setting. We focus on the convex hull problem and describe how to modify the proof of 
Theorem 3.5. We need a technical lemma first: 

Lemma 4.1. Suppose we place n random elements independently in t bins, where each element is placed 
in the k-th bin with probability nk/n. Then the probability that the k-th bin contains exactly nk elements for 
all k = 1 ,..., t is at least n~°^. 

Proof. The probability is exactly ni , n! . nt! (tt )” 1 ‘ " (if)™* > which by Stirling’s formula is 

_ 9(y / n)(n/e) 7t _ (0±) ni (0±\ nt c 1 

0 (\/nr)(?ri/e) ni • • • 0 (y / rq)(nt/e) n ‘ in/ in/ — ©(v^) 4-1 ’ 

yielding the result. □ 

We now present our lower-bound proof in the random-order setting. (The proof is loosely inspired by the 
randomized “bit-revealing” argument by [Chan 2010].) 



Theorem 4.2. OPT avg (S') G tt(n(TL(S) + 1)) for the upper hull problem in any constant dimension d in 
the multilinear decision tree model. 


PROOF. Fix a sufficiently small constant 5 > 0. Let T be as in the proof of Theorem 3.5, except that we 
keep only the first [<51ognJ levels of the tree, i.e., when a node reaches depth |_61ogn|, it is made a leaf. 

Let IIp ar t-tree be the partition of S formed by the leaf cells in T. Let n part -tree be a refinement of n part -tree in 
which each leaf cell is further triangulated and each subset corresponding to a cell of depth [6 log ?rj is further 
subpartitioned into singletons. Note that each such subset has size Q(n 5 ) and contributes 0((?i < 5 /n) logn) 

to both the entropy of Ii par t-t ree and 11 par t-t ree • Thus, Id. (Upai-t-tree) G 0(11 part-tree))• Clearly, n par t_tree is 
respectful. 

The adversary proceeds differently. We do not explicitly maintain the invariant that no node v is full. 
Whenever some v p first becomes a leaf, we assign p to a random point among the points in Q(v p ) that 
has previously not been assigned. If all points in Q(y p ) have in fact been assigned, we say that failure has 
occurred. 

Suppose that the simulation encounters a test “f{pi, ... ,p c ) > 0”. We do the following: 

— We reset each v Pi to one of its children at random, where each child v p . is chosen with probability 

\QW Pi )\/\Q(. v Pi)\ (which is in 0(1/6)). If the tuple (v' ,..., v' ) is good (as defined in the proof of 

Theorem 3.5), then the comparison is resolved. Otherwise, we repeat. 

Since we have shown that the number of bad tuples is in o( 6 °), the probability that the test is not resolved 
in one step is in o( 6 °) • 0(1/6)°, which can be made less than 1/2 for a sufficiently large constant b. The 
number of iterations per comparison is thus upper-bounded by a geometrically distributed random variable 
with mean 0 ( 1 ). 

Let T be the number of comparisons made. Let D be the sum of the depth of v p over all points p € S at the 
end of the simulation. Clearly, D is upper-bounded by the total number of iterations performed, which is at 
most a sum of T independent geometrically distributed random variables with mean 0(1). Let (*) be the event 
that D < c 0 T for a sufficiently large constant cq. By the Clrernoff bound, Pr[(*)] > 1 — 2~ n ^ > 1 — 

By the same argument as before, at the end of the simulation, v p must be a leaf for every p G S, assuming 
that failure has not occurred. 

Let (f) be the event that failure has not occurred. If (*) and (f) are both true, then 


T G n{D) 

E |Q(v)|log(n/|<9(u)|) 
Vleaf v 

f= - l{o,d~L (I [part-tree) ) 

G 1 KuH (I [ par t-tree ) ) 

C f 


To analyze Pr[(f)], consider the leaf v p that a point p ends up with after the simulation (regardless of 
whether failure has occurred). This is a random variable, which equals a fixed leaf v with probability \Q(y)\/n. 
Moreover, all these random variables are independent. Failure occurs if and only if for some leaf v, the number 
of Up’s that equal v is different from |Q(u)|. By Lemma 4.1, Pr[(j)] > n~°^ n \ since there are 0(n s ) leaves 
in T. It follows that 


Prjnot (*) | (f)] < 


Pr[not (*)] 
Pr(f) 


G 


2 —f2(n) 

j^—0(n s ) 


2 — G2(rz,) 


Finally, observe that Pr[(f) A (the input equals a)\ is the same for all fixed permutations a of S (the 

probability is exactly Jji ea f v (^ 7 T^) |Q(E)|! )• °th er words, conditioned to (f), the input is a random 

permutation of S, i.e., the adversary have not acted adversarily after all! It follows that T G Q,{riH{S)) with 
high probability for a random permutation of S. In particular, E[T] G O (n'H(S)) for a random permutation 
of S. □ 


Remark 4.3. Applying the same ideas to the proof of Theorem 2.4 shows that OPT avg (S') G £l(n(H(S) + 
1 )) for the maxima problem in the comparison model. 
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Fig. 7. An instance where nH(S) is no longer a lower bound if nonmultilinear tests are allowed. A circular disk covers all 
nonmaximal points underneath the staircase. An algorithm tailored to this instance can identify the n—h points that are inside 
the disk by O(n) nonmultilinear tests, then compute the staircase of the remaining h points in (J(h log h) time, and verify that 
the disk is underneath the staircase by 0(h) additional nonmultilinear tests. 


5. ON THE MULTILINEAR MODEL 

We remark that if nonmultilinear test functions are allowed, then nH(S) may no longer a valid instance- 
optimal lower bound under our definition of T~L{S). For example, one can design both an instance S of the 
2-d maxima problem with h output points, having %{S) £ 0(log/i) (see Figure 7), and an algorithm A that 
requires just Oin + h log h ) operations on that instance using nonmultilinear tests. A similar example can 
be constructed for the 2-d convex hull problem. 

Nevertheless, many standard test functions commonly found in geometric algorithms are multilinear. For 
example, in 3-d, the predicate ABOVE(p 1; ... ,p^) which returns true if and only if pi is above the plane 
through p- 2 ,P 3 ,P 4 can be reduced to testing the sign of a multilinear function (a determinant). 

To see the versatility of multilinear tests, consider the following extended definition: we say that a 
function / : (R d ) c — > R d is quasi-multilinear if /(pi,... ,p c ) = (/i(pi, • • ■ ,p c ), • • ■, fd(pi, ■ ■ ■ ,Pc)) where 
fi = hi(pi,... ,Pc)/g(pi, ■ ■ ■ ,Pc) in which fi, ■ ■ ■, fd,g '■ (R d ) c —> M are multilinear functions. In 3-d, the 
function PLANE(pi,p 2 ,p 3 ) which returns the dual of the plane through pi,p 2 ,P 3 is quasi-multilinear; sim¬ 
ilarly the function lNTERSECT(pi,p 2 ,p 3 ) which returns the intersection of the dual planes of pi,P 2 ,P 3 is 
quasi-multilinear. This can be seen by expressing the answer as a ratio of determinants. 

More generally, we have the following rules: 


— if the function /* : (R 3 ) Ci -a- R 3 is quasi-multilinear for each i £ 

{1,2,3}, then PLANE(/i(pn,... ,pi Cl ), / 2 (p2i, • • • ,p 2c2 ), hipsi, ■ ■ ■ ,PSc 3 )) and 

INTERSECt(/i(ph, ... ,P 1 C1 ), / 2 (P 21 , • • ■ ,P 2 c 2 ), f^ipzi, ■ ■ ■ ,P3c ? )) are quasi-multilinear; 

— if the function fi : (R 3 ) Ci —> R 3 is quasi-multilinear for each i £ {1,...,4}, then 

ABOVe(/i( pn,... ,Pici), ■ • ■, / 4 (p 4 i,...,P4 c 4 )) can be reduced to testing the sign of a multilinear func¬ 
tion. 


By combining the above rules, more and more elaborate predicates can thus be reduced to testing the 
signs of multilinear functions, such as in the following example: 


/ Pio, \ 

Pn, 

P12, 


V 


ABOVE 


INTERSECT 


PLANE(pi,p 2 ,p 3 ), \ 
PLANE(p4,p 5 ,p 6 ), 
PLANE (p 7 , pg, pg) ) j 













However, we may run into problems if a point occurs more than once in the expression, as in the following 
example: 


ABOVE 


PlO, 

Pu, 

Pi, 

INTERSECT 


PLANE(pi, P 2 , P 3 ), 
PLANE (p 4 , P 5 , Pq ), 
PLANE(p 7 ,p 8 ,p 9 ) 



Here, the expansion of the determinants may yield monomials of the wrong type. In most 2-d algorithms, 
this kind of tests does not arise or can be trivially eliminated. Unfortunately, they can occasionally occur in 
some 3-d algorithms, including our 3-d upper hull algorithm in Section 3.3. We now describe how to modify 
our algorithm to avoid these problematic tests. 

First, we consider the partition construction in Lemma 3.3. We choose the more elementary alternative 
based on the 8 -sectioning theorem: there exist 3 planes that divide space into 8 regions, each with n/8 points 
of Q. By perturbing the 3 planes one by one, we can ensure that each of the 3 planes passes through 3 
input points, and that the resulting 9 points are distinct, while changing the number of points of Q in each 
region by ±0(1). A brute-force algorithm can find 3 such planes in polynomial time. We can reduce the 
construction time by using the standard tool called epsilon-approximations [Matousek 2000]: we compute a 
^-approximation of Q in linear time for a sufficiently small constant 5 > 0, and then apply the polynomial 
algorithm to the constant-sized ^-approximation. This only changes the fraction 1/8 by a small term ±0(<5). 
It can be checked that known algorithms for epsilon-approximations [Matousek 2000] require only multilinear 
tests (it suffices to check the implementation of the so-called subsystem oracle , which only requires the ABOVE 
predicate). We remove the 9 defining points before recursively proceeding inside the 8 regions. As a result, 
we can ensure that the facets in each convex polyhedral cell are all defined by planes that pass through 3 
input points, where no two planes share a common defining point. A vertex v of a cell is an intersection of 
3 such planes and is defined by a set of 9 distinct input points, denoted DEf(i>). 

Next, we consider the proof of Lemma 3.8 for answering r linear programming queries. We choose the 
alternative approach by [Chan 1996a] based on a partition construction, which we have from the previous 
paragraph. This algorithm is based on a deterministic version of the sampling-based linear programming 
algorithm by [Clarkson 1995]. The algorithm can also support up to r insertions and deletions of halfspaces 
intermixed with the query sequence. The algorithm can be implemented with simple predicates such as 
ABOVE. 

Now, in the algorithm hull3d, we make one change: in line 5, we prune only when each vertex v of 7 * 
lies strictly below the upper hull of Q — DEf(v) (instead of the upper hull of Q). In the dual, testing such a 
vertex v reduces to a linear programming query after deletion of DEF(t>) from Q , where the coefficient vector 
of the objective function is quasi-multilinear in def(u). Since DEf(v) has been deleted from Q , we avoid the 
problem of test functions where some point appears more than once in the expression. It can be checked that 
applying the algorithm for linear programming queries from the previous paragraph indeed requires only 
multilinear tests now. 

Since the pruning condition has been weakened, the analysis of hull3d needs to be changed. Recall that 
the partition in line 3 is constructed by recursive application of the 8 -sectioning theorem. At half the depth 
of recursion, we obtain an intermediate partition of Q with subsets Q' e and corresponding cells 7 / 

where each subset has 0(n/y/rj) points, and every plane intersects at most 0(^/rJ 1 ~ e ) of these cells 7 ^, for 
e = 1 — log 8 7. Furthermore, for a fixed j' e , every plane intersects at most 0( y /rj 1 ~ e ) of the cells 7 * of the 
final partition inside 7 ^. 

In the second paragraph of the proof of Theorem 3.9, we do the analysis differently. Consider a cell 7 * of 
the partition, which is contained in a cell 7 / of the intermediate partition. We claim that if (i) 7 ^ is strictly 
contained in and (ii) 7 , is strictly contained in 7 ^, then all points inside 7 j are pruned. To see this, notice 
that by (i), all points in Q' e are strictly below the upper hull of Q, and by (ii), the defining points def(u) of 
any vertex v of 7 » are in Q’ t Thus, the points in def(i>) are strictly below the upper hull of Q, i.e., the upper 
hull of Q — def(v) is the same as the upper hull of Q. As each vertex v of 7 is strictly below the upper hull 
of Q — def(u), all points inside 7 are indeed pruned. 

At most 0(^rj 1 ~ £ ) cells 7 ^ can intersect the boundary of A*,. For each of the 0(y/rj) cells 7 ^ 
strictly contained in A*,, at most 0{ y /rj 1 ~ e \ogrj) cells 7 * inside 7 / can intersect the O(logrj) bound¬ 
ary facets of 7 ^. Hence, the number of points in Sk that remain in Q after iteration j is at most 



min {IS'fcl, 0{ y Jrj 1 e • n/y/rj + ^/rj ■ (^/rj 1 s log rj) ■ n/rf)} = min ||5 fe |, 0((n/r s / 2 ) logr^)}. The rest of 
the proof is then the same, after readjusting e by about a half. 

6. OTHER APPLICATIONS 

We can apply our techniques to obtain instance-optimal algorithms for a number of geometric problems in 
the order-oblivious and random-order setting: 

(1) Off-line halfspace range reporting in 2-d and 3-d : given a set S of n points and halfspaces, report the 
subset of points inside each halfspace. Algorithms with 0(nlog?i + iv) running time [Cliazelle et al. 1985; 
Chan 2000; Afshani and Chan 2009; Chan and Tsakalidas 2015] are known for total output size K. 

(2) Off-line dominance reporting in 2-d and 3-d: given a set S of n red/blue points, report the subset of red 
points dominated by each blue point. The problem has similar complexity as (1). 

(3) Orthogonal segment intersection in 2-d: given a set S of n horizontal/vertical line segments, report all 
intersections between the horizontal and vertical segments, or count the number of such intersections. 
The problem is known to have worst-case complexity @(nlogn + K) in the reporting version for output 
size K, and complexity 0(nlogn) in the counting version [de Berg et al. 1997; Preparata and Shamos 
1985]. 

(4) Bichromatic Loo-close pairs in 2-d: given a set S of n red/blue points in 2-d, report all pairs (p, q ) where 
p is red, q is blue, and p and q have Loo-distance at most 1, or count the number of such pairs. Standard 
techniques in computational geometry [de Berg et al. 1997; Preparata and Shamos 1985] yield algorithms 
with the same complexity as in (3). 

(5) Off-line orthogonal range searching in 2-d: given a set S of n points and axis-aligned rectangles, report 
the subset of points inside each rectangle, or count the number of such points inside each rectangle. The 
worst-case complexity is the same as in (3). 

( 6 ) Off-line point location in 2-d: given a set S of n points and a planar connected polygonal subdivision of 
size O(n), report the face in the subdivision containing each point. Standard data structures [de Berg 
et al. 1997; Preparata and Shamos 1985; Snoeyink 1997] imply a worst-case running time of 0(?rlogn). 

For each of the above problems, it is not difficult to see that certain input sets are indeed “easier” than 
others, for example, if the horizontal segments and the vertical segments respectively lie inside two bounding 
boxes that are disjoint, then the orthogonal segment intersection problem can be solved in 0(n) time. 

Note that although some of the above problems may be reducible to others in terms of worst-case complex¬ 
ity, the reductions may not make sense in the instance-optimality setting. For example, an instance-optimal 
algorithm for a problem does not imply an instance-optimal algorithm for a restriction of the problem in a 
subdomain, because in the latter case, we are competing against algorithms that have to be correct only for 
input from this subdomain. 

6.1. A general framework for reporting problems 

We describe our techniques for off-line reporting problems in a general framework. Let 7Z C K. d x be a 
relation for some constant dimensions d and d'. We say that a red point p £ and a blue point q £ 
interact if (p, q) £ TZ. We consider the reporting problem: given a set S containing red points in and blue 
points in of total size n, report all K interacting red/blue pairs of points in S. (By scanning the output 
pairs, we can then collect the subset of all blue points that interact with each red point, in 0{K) additional 
time.) 

We redefine H(S) as follows: 

Definition 6.1. Given a cell 7 colored red (resp. blue), we say that 7 is safe for S if every red (resp. blue) 
point in 7 interacts with exactly the same subset of blue (resp. red) points in S. We say that a partition II 
of S is respectful if each subset Sk in II is a singleton, or a subset of red points enclosed by a safe red simplex 
Afc for S, or a subset of blue points enclosed by a safe blue simplex A*, for S. Define the structural entropy 
H{S) of S to be the minimum of "H(II) = 'f2 k (\Sk\/n) log(n/|S'/ c |) over all respectful partitions II of S. 

Theorem 6.2. OPT(S), OPT avg (5) e f + 1) + K) for the reporting problem in the multilinear 
decision tree model. 

Proof. This follows from a straightforward modification of the proofs of Theorems 3.5 and 4.2. The 
main difference is that we now keep two partition trees, one for the red (resp. blue) points in S, with cells 
colored red (resp. blue). If a cell r(u) is safe for S, or if the number of red (resp. blue) points in r(u) drops 



below a constant, then we make v a leaf in the red (resp. blue) partition tree. At the end, we argue that v p 
must be a leaf for every red (resp. blue) point p € S. Otherwise, the red (resp. blue) cell r(u p ) contains at 
least two red (resp. blue) points and is not safe, so we can move p to another point inside r(u p ) and change 
the answer. The algorithm would be incorrect on the modified input. (The f l(K) term in the lower bound is 
obvious.) □ 

For the upper-bound side, we assume the availability of three oracles concerning 1Z , where a is some 
positive constant: 

(A) A worst-case algorithm for the reporting problem that runs in 0(?rlog?i + K) time. 

(B) A data structure with 0(n\ogn) preprocessing time, such that we can report all k blue (resp. red) points 
in S interacting with a query red (resp. blue) point in 0(n 1- “ + k) time. 

(C) A data structure with 0(n log n) preprocessing time, such that we can test whether a query red or blue 
convex polyhedral cell 7 of size a is safe for S in 0(an 1_ “) time. 

Note that we can reduce to preprocessing time in (B) to 0((n/m) ■ m log m) = 0(n log m ) while increasing 
the query time to 0((n/m)-m 1 ~°‘+K) for any given 1 < m < n. This follows from the grouping trick by [Chan 
1996c]: namely, divide S into \n/m] subsets of size 0(m) and build a data structure for each subset. By setting 
m = r 1 /", we can then answer r queries in total time 0(n log m+r-{n/m)-m}~ a -\-k) C 0[n log r+r 0 ^ +n) for 
total output size k. Similarly, in (C), we can answer r queries in total time 0(n log r + ar°^). The grouping 
trick is applicable because the query problems in (B) and (C) are decomposable, i.e., the answer of a query 
for a union of subsets can be obtained from the answers of the queries for the subsets. 

We now solve the reporting problem by a variant of the hull3d algorithm in Section 3.3: 

report(Q): 

1 . for j = 0, 1 ,..., [log((51ogn)J do 

2 . partition the red points in Q by Lemma 3.3 to get rj = 2 2J subsets Q 1 ,..., Q, , 

and red cells 71 ,..., . 

3. for each * = 1 to r :] do 

4. if 7 i is safe for Q then 

5. let Zi be the subset of blue points in Q that interact with 
an arbitrary red point in Qi 

6 . output Qi x Zi 

7. prune all red points in Qi from Q 

8 . redo lines 2 7 with “red” and “blue” reversed 

9. solve the reporting problem for the remaining set Q directly 

The test in line 4 for each convex polyhedral cell 7 , of size at most Ollogrj) can be done by querying the 
data structure in (C), and line 6 can be done by querying the data structure in (B); the cost of 0(r ? ) queries 
is 0(\Q\ log rj + rf^) plus the output size. Line 9 can be done by the algorithm in (A). 

Theorem 6.3. Given oracles (A), (B), and (C), algorithm report(5) runs in 0(n('H(S) + 1) + K) time. 
Proof. The analysis is as in the proof of Theorem 3.9. □ 

The partition construction in line 2 can be done in the multilinear model, as described in Section 5. 
Whether the rest of the algorithm works in the multilinear model depends on the implementation of the 
oracles. 

For orthogonal-type problems dealing with axis-aligned objects, such as problems (2)-(5) in our list, we 
can work instead in the comparison model. We just replace simplices with axis-aligned boxes in the definition 
of 7i(S), replace convex polyhedral cells with axis-aligned boxes in oracle (C), and replace partition trees 
with k -d trees in both the lower-bound proof and the algorithm. 

We can immediately apply our framework to the reporting versions of problems (1)—(5), after checking the 
oracle requirements for (B) and (C) in each case: 

(1) Off-line halfspace range reporting in 2-d and 3-d : For the design of the needed data structures, it suffices 
to consider just the lower halfspaces in the input. Color the given points red, and map the given lower 
halfspaces to blue points by duality. The data structure problem in (B) is just halfspace range reporting. 
The data structure problem in (C) is equivalent to testing whether any of the 0(a) edges of a query convex 



polyhedral cell intersects a given set of n hyperplanes (lines in 2-d or planes in 3-d). This reduces to 
simplex range searching [Agarwal and Erickson 1999; Matousek 1992] by duality; known results achieve 
0{n log n) preprocessing time and close to 0{an 1 ~ 1 ^ d ) query time. It can be checked that the entire 
algorithm is implementable in the multilinear model, at least using a simpler randomized algorithm for 
(A) [Chan 2000], 

(2) Off-line dominance reporting in 2-d and 3-d : The data structure problem in (B) is just dominance 
reporting. The data structure problem in (C) is equivalent to testing whether all the corners of a query 
box are dominated by the same number of points from a given n-point set. This reduces to orthogonal 
range counting [Agarwal and Erickson 1999; de Berg et al. 1997; Preparata and Shamos 1985]; although 
better data structures are known, fc-d trees are sufficient for our purposes, with O(nlogn) preprocessing 
time and 0(n 1_1 / d ) query time. The entire algorithm works in the comparison model. 

( 3 ) Orthogonal segment intersection in 2-d: Map each each horizontal line segment (x, y){x', y) to a red 
point (x,x',y) £ R 3 and each vertical line segment (£,?7)(£,?/) to a blue point (£,77,77') £ R 3 . Each 
point in R 3 is the image of a horizontal/vertical line segment. The data structure problem in (B) for red 
queries corresponds to reporting all points from a given n-point set that lie in a query range of the form 
{(£,77,7/) £ R 3 : ((x < £ < x') V ( x ' < £ < 2 :)) A ((77 < y < if) V (77' < y < 77))} for some x,x',y. This 
reduces to 3 -d orthogonal range reporting. The data structure problem in (C) for red queries corresponds 
to testing whether a query box in R 3 intersects any of the boundaries of n given ranges, where each 
range is of the form {( 2 :, x ', y) £ R 3 : ((x < £ < x') V {x' < £ < 2 ;)) A ((77 < y < if) V {if <y< 77))} for 
some £,77,77'. This is an instance of 3 -d orthogonal intersection searching [Agarwal and Erickson 1999 ], 
which reduces to orthogonal range searching in a higher dimension. Again fc-d trees are sufficient for our 
purposes. Blue queries are symmetric. The entire algorithm works in the comparison model. 

(4) Bichromatic Loo-close pairs in 2-d: The problem in (B) corresponds to reporting all points of a given 
point set that lie inside a query square of side length 2. This is an instance of orthogonal range reporting. 
The problem in (C) corresponds to testing whether a query box intersects any of the edges of n given 
squares of side length 2. This is an instance of orthogonal intersection searching. Note that here the 
resulting algorithm requires a slight extension of the comparison model, to include tests of the form 
Xi < x'j + a mentioned in Remark 2.5, which are allowed in the lower-bound proof. 

(5) Off-line orthogonal range reporting in 2-d: Color the given points red, and map each rectangle [£,£'] x 
[77,77'] to a blue point (£, £', 77,77') £ R 4 . Every point in R 4 is the image of a rectangle. The problem in (B) 
for red queries corresponds to 2 -d rectangle stabbing, i.e., reporting all rectangles, from a given set of n 
rectangles, that contain a query point. The problem in (B) for blue queries corresponds to 2-d orthogonal 
range reporting. The problem in (C) for red queries corresponds to deciding whether a query box in R 2 
intersects any of the edges of n given rectangles. The problem in (C) for blue queries corresponds to 
deciding whether a query box in M 4 intersects any of the boundaries of n given ranges, where each range 
is of the form {(£, £', 77, if) £ R 4 : ((£ < x < £') V (£' < x < £)) A ((77 < y < 77') V {if <y< 77))} for some 
x , y. All these data structure problems reduce to orthogonal range or intersection searching. Again the 
algorithm works in the comparison model. 

6.2. Counting problems 

Our framework can also be applied to counting problems , where we simply want the total number of interact¬ 
ing red/blue pairs. We just change oracle (A) to a counting algorithm without the 0{K) term, and oracle (B) 
to counting data structures without the 0{n) term. In line 5 of the algorithm we compute \Z\, and in line 6 we 
add |Q,:| x \Z\ to a global counter. The same lower- and upper-bound proofs yield an Q{n{'H{S) + 1)) bound. 
The new oracle requirements are satisfied for (3) orthogonal segment intersection counting, (4) bichromatic 
Loo-close pairs, and (5) off-line orthogonal range counting. 

We can also modify the algorithm to return individual counts, i.e., compute the number of red points that 
interact with each blue point and the number of blue points that interact with each red point. Here, we need 
to not only strengthen oracle (A) to produce individual counts, but also modify oracle (B) to the following: 

(B) A data structure with O(nlogn) preprocessing time, forming a collection of canonical subsets of total 
size 0 (? 7 log?r), such that we can express the subset of all blue (resp. red) points in S interacting with a 
query red (resp. blue) point, as a union of 0 (?r 1_ “) canonical subsets, in 0{ 7i 1-Q ) time. 

As before, the grouping trick can be used to reduce the preprocessing time and total size of the canonical 
subsets. In line 4 of the algorithm we express Z as a union of canonical subsets. In line 5 we add \Z\ to the 
counter of each red point in Q, and add |Q,| to the counter of each canonical subset for Z. At the end of 



the loop in lines 3-7, we make a pass over each canonical subset and add its counter value to the counters 
of its blue points, before resetting the counter of the canonical subset. Line 8 is similar. The analysis of the 
running time remains the same. The strengthened oracle requirements are satisfied for problems (3), (4), 
and (5) by known orthogonal range searching results. 

6.3. Detection problems? 

We can also consider detection problems where we simply want to decide whether there exists an interacting 
red/blue pair. Here, we redefine H(S) by redefining “safe”: a red (resp. blue) cell 7 is now considered safe 
for S if no red (resp. blue) point in 7 interacts with any blue (resp. red) points in S. We change oracles (A) 
and (B) to analogous detection algorithms and data structures, without the O(K) and 0{k) terms. 

The proof of the upper bound 0{n(fH(S) +1)) is the same, but unfortunately the proof of the lower bound 
fl(n(fH(S) + 1)) only works for instances with a NO answer: at the end, if v p is not a leaf for some red (resp. 
blue) point p £ S, then T(u p ) contains at least two red (resp. blue) points and is not safe, so we can move p 
to some point inside T(u p ) and change the answer from NO to yes. 

yes instances are problematic, but this is not a weakness of our technique but of the model: on every input 
set S with a yes answer, OPT(S') is in fact 0(n). To see this, consider an input set S for which there exists 
an interacting pair (p,q). An algorithm that is “hardwired” with the ranks of p and q in S with respect to, 
say, the assorted order of S can first find p and q from their ranks by linear-time selection, verify that p 
and q interact in constant time, and return yes if true or run a brute-force algorithm otherwise. Then on 
every permutation of this particular set S, the algorithm always takes linear time. Many problems admit 
f2(?rlog?i) worst-case lower bounds even when restricted to YES instances, and for such problems, instance 
optimality in the order-oblivious setting is therefore not possible on all instances. 

6.4. Another general framework for off-line querying problems 

We now study problems from another general framework. Let A4 be a mapping from points in to “answers” 
in some space for some constant d (the answer A i(q) of a point q £ may or may not have constant size 
depending on the context). We consider the following off-line querying problem: given a set S of n points in 
M d , compute A i(q) for every q £ S. Let K denote the total size of the answers. 

We redefine H(S) by redefining “safe”: 

Definition 6.4. We say that a cell 7 is safe if every point q in 7 has the same answer M.{q). We say that 
a partition n of S is respectful if each subset Sk in n is a singleton, or a subset of points enclosed by a safe 
simplex A*,. Define the structural entropy H(S) of S to be the minimum of "H(n) = X]fc(l'Sfcl/ n ) log(n/|Sfc|) 
over all respectful partitions n of S. 

Theorem 6.5. OPT(S), OPT avg (5') £ f2(n(%(S) + 1) + K) for the off-line querying problem in the 
multilinear decision tree model. 

Proof. This follows from a straightforward modification of the proofs of Theorems 3.5 and 4.2. As before, 
if a cell T(v) is safe, then we make v a leaf. At the end, we argue that v p must be a leaf for every p £ S. 
Otherwise, T(u p ) is not safe, so we can move p to another point inside T(u p ) and change the answer. The 
algorithm would be incorrect on the modified input. □ 

The above lower bound holds even if we ignore the cost of preprocessing A4. Furthermore, the test functions 
are only required to be multilinear with respect to S, not AL 

For the upper-bound side, we assume that A4 has been preprocessed in an oracle data structure supporting 
the following types of queries: 

(A) Given q £ R d , we can compute A 4(q) in 0(logm + k) worst-case time for output size k, where m is a 
parameter describing the size of AL 

(C) Given a convex polyhedral cell 7 of size a, we can test whether 7 is safe in 0(am 1 ^ a ) time. 

The algorithm is simpler this time. Instead of using a 2 2J progression, we can use a more straightforward 
b- way recursion, for a sufficiently large constant b (the resulting recursion tree mimics the tree T from the 
lower-bound proof in Theorem 3.5, on purpose): 



off-line-queries(Q, F), where Q C T: 

1 . if |Q | drops below n/m s then compute the answers directly and return 

2 . partition Q by Lemma 3.3 to get b subsets Qi, ■ ■ ■ ,Qb and cells 71,...,75 

3. for i = 1 to b do 

4. if 7 , fl r is safe then 

5. compute A i(q) for an arbitrary point q £ 7 * (~l T 

6 . output A i(q) as the answer for all points in Qi 

7. else off-line-queries(Qi, 7 * nr) 

We call off-line-queries^,!^) to start. Line 1 takes 0(|Q| log jti+k) time for output size n by querying 
the data structure for (A); note that each point in Q in this case has participated in f2(log?n) levels of 
the recursion, and we can account for the first term by charging each point unit cost for every level it 
participates in. Line 2 takes 0(|Q|) time for a constant b by known constructions of Matousek’s partition 
theorem [Matousek 1992] (or alternatively recursive application of the 4- or 8 -sectioning theorem in the 2-d 
or 3-d case). The test in line 4 takes 0(m 1_a polylogm) time by querying the data structure for (C), since 
the convex polyhedral cell 7 i (~l T has at most O(logm) facets (and thus O(polylogm) size). As the tree has 
0(m s ) nodes, the cost of line 4 is negligible, since its total over the entire recursion tree is sublinear in m by 
choosing a sufficiently small constant 6 < a. Line 5 takes 0(logm + k) time for output size k, by (A); the 
O(logm) term is again negligible, since its total over the entire recursion tree is sublinear in m. 

Theorem 6 . 6 . After Ai has been preprocessed for (A) and (C), algorithm off-line-queries^, R d ) 
runs in 0(n(7~L(S) + 1) + I\) + o{m) time for total output size K. 

Proof. Let Uj be number of points in S that survive level j. i.e., participate in subsets Q at level j of 
the recursion. The total running time for the off-line problem is asymptotically bounded by y~b nj , ignoring 
a o(m) extra term. 

Let II be any respectful partition of S. Consider a subset Sk in II. Let A* be a safe simplex enclosing Sk- 
Fix a level j. Let Qfs and ji’s be the subsets Q and cells 7 at level j. Each Qi has size at most n/Q(bft. 
The number of y^’s that intersect the boundary of A*, is at most Thus, the number of points in 

Sk that survive level j is at most min {|Sfc|, 0(6 1—e )- 7 • n/0(bftj. Since the Sfts cover the entire point set, 
with a double summation we have, for a sufficiently large constant b, 

^ min {I'S'fehn/e^} 

3 i fc 

= ^^mm{|S fc |,n/ 0 (&) ej } 

k j 

C £0(|S fc |(log(n/|S fc |) + l)) 
k 

= 0(n(H( n) + 1)), 

which yields the theorem. □ 

For orthogonal-type problems, we can work instead in the comparison model. We just replace simplices 
with axis-aligned boxes in the definition of H(S), replace convex polyhedral cells with axis-aligned boxes in 
oracle (C), and replace partition trees with fc-d trees in both the lower-bound proof and the algorithm. 

We can apply our framework to solve problem ( 6 ): 

( 6 ) Off-line point location in 2-d: For (A), data structures for planar point location with 0(log?n) worst- 
case query time are known, with 0(m) preprocessing time and space [Kirkpatrick 1983; Chazcllc 1991; 
Snoeyink 1997]. The data structure problem in (C) is equivalent to testing whether any of the 0(a ) edges 
of a query polygon intersect the given polygonal subdivision. This reduces to ray shooting (or segment 
emptiness) queries in the subdivision, for which known results [Chazelle et al. 1994] achieve 0(log?n) 
query time, with O(m) preprocessing time and space. For this problem, each answer has constant size, so 
the O(K) and O(k) terms can be omitted. The total running time is 0(n('H(S) + 1)), even if preprocessing 
time is included, for a subdivision of size m = 0(n). It can be checked that the entire algorithm works 
in the multilinear model. 



6.5. On-line querying problems and distribution-sensitive data structures 

In the general framework of the preceding section, we can also consider the following on-line querying 
problem: given a set S of n points in R d , build a data structure so that we can compute M(q) for any query 
point q G R d , while trying to minimize the average query cost over all q € S. 

Our off-line lower bound states that the total time required to answer queries for all n points in S is 
fl(n('H(S) + 1)). This immediately implies that the average query time over all q G S must be + 1). 

(In contrast, lower bounds for the on-line problem do not necessarily translate to lower bounds for the off-line 
problem.) 

On the other hand, our algorithm for off-line queries can be easily modified to give a data struc¬ 
ture for on-line queries. We just build a data structure corresponding to the recursion tree generated by 
off-line-queries)^,R d ), in addition to the data structure for (A) and (C). It can be shown that with such 
a data structure, the average query time over all q G S is 0('H(S) + 1) (more details are given below). 

We can extend the on-line querying problem to the setting where each point in S is weighted and the 
goal is to bound the weighted average query time over the query points in S. Even more generally, we can 
consider the setting where S is replaced by a (possibly continuous) probability distribution and the goal is 
to bound the expected query time for a query point randomly chosen from S. We now provide more details 
for the changes needed in this most general setting. 

We first redefine 'H(S') for a probability distribution S. 

Definition 6.7. We say that a cell 7 is safe if every point q in 7 has the same answer A4 (</). A partition II of 
R d into regions is respectful if each region Sk of II can be enclosed by a safe simplex A k • Define the structural 
entropy TL(S) of a probability distribution S to be the minimum of 77(11) = ^2 k ps(Sk) ^og(l/ps{Sk)) over 
all respectful partition II of R d , where /is denotes the probability measure corresponding to S. 

We need a continuous version of Lemma 3.3, which follows by straightforward modification to the proof 
of the partition theorem [Matousek 1992] (or alternatively, recursive application of the 4- or 8-sectioning 
theorem in the 2-d or 3-d case). 

Lemma 6 .8. For any probability measure in R d and 1 < r < n for any constant d, we can partition R d 
into r (not necessarily convex or connected) polyhedral regions Qi, ■ - ■ ,Q r each with measure @(l/r) and 
with r 0 ^ (or fewer) facets, and find r convex polyhedral cells 71 ,... , 7 r each with O(logr) (or fewer) facets, 
such that Qi is contained in 7 i, and every hyperplane intersects at most O^r 1 ^ 6 ) cells. Here, e > 0 is a 
constant that depends only on d. 

The lower bound proof is easier for on-line problems, so we present the simplified proof in full below. 
Because the input to a query algorithm is just a single point (unlike in the off-line setting where we could 
perform a test that has more than one query points as arguments), multilinear tests can now be replaced 
with linear tests. 

Theorem 6.9. Any algorithm for the on-line querying problem requires + 1 + k) expected query 

time for output size n, for any probability distribution S in the linear decision tree model. 

Proof. We define a partition tree T as follows: Each node v stores a pair (Q(u), T(i>)), where Q(v) is a 
region enclosed inside a convex polyhedral cell T(u). The root stores (R d , R d ). If ps{Q(,v)) drops below l/m s 
or T(v) is safe, then v is a leaf. Otherwise, apply Lemma 3.3 with r = b to the restriction of ps to Q(v), and 
obtain regions Qi, ■ ■ ■ ,Qb and cells 71 ,..., 7 b . For the children iq,..., Vb of v, set Q(vi) = Qi 0 Q(v) and 
T(uj) = 7 * nT(u). For a node v at depth j of the tree T we then have ps(Q(v)) > I/O (by, and consequently, 
the depth j is in fl(\og b {l/p s {Q(v)))). 

Let n part . tree be the partition formed by the cells T(u) at the leaves v in T. Let n part -tree be a refinement 
of this partition after triangulating the leaf cells. Note that the subpartitioning of a leaf cell at depth j 
causes the entropy to increase by at most 0(ps(Q(v)) log(j log6)) C 0(ps{Q{v)) loglog(l//xs(Q(u)))) for 

any constant b. So 77 ( I I part-tree) G O (77 ( H par t-tree)) ■ dearly, Upart-tree is respectful. 

The adversary constructs a bad query point q as follows. During the simulation, we maintain a node v q in 
7 ”, where the only information the algorithm knows about q is that q lies inside T(u 9 ). 

Suppose that the simulation encounters a test to determine which side q lies inside a hyperplane h. We do 
the following: 

— We reset v q to one of its children at random, where each child v' q is chosen with probability ps{Q' q )/HsiQq) 

(which is in 0(1/6)). If T^ does not intersect h, then the comparison is resolved. Otherwise, we repeat. 



The probability that the comparison is not resolved in a single step is at most 0(6 1_£ ) • 0(1/6), which can 
be made less than 1/2 for a sufficiently large constant b. 

Let T be the number of tests made. Let D be the depth of v q at the end. For i < T. let T) = 1 and Di 
be the number of steps needed to resolve the ?'-th test; then E [Di] < 2. For i > T, let T* = Di = 0. Since 
E[2Tj — Di] > 0 for all i, by linearity of expectation E/D] = E[^A Df] < 2E[JT T)] = 2E[T]. 

At the end of the algorithm, v q must be a leaf. Thus, 

E[T] > D(E [D]) 

Cfi E n s {Q{v))\og{l/n s {Q{v))) j 

\leaf v / 

c ft(w(n part -tree)) 
c si(H( n part -tree)) 

C DCH(S)). 

At the end, we can assign q to a random point in Q(v q ) chosen from the distribution S. Then q satisfies 
precisely the probability distribution S (in other words, the adversary have not acted adversarily after all). 
Thus, E[T] is the expected query time for a query point randomly chosen from the distribution S. □ 

For the upper-bound side, below is the pseudocode for the preprocessing algorithm and the query algorithm, 
which are derived from our previous algorithm off-line-queries; here, b is a sufficiently large constant. 
The query algorithm assumes an oracle data structure for (A) (oracle (C) is only needed in the preprocessing 
algorithm). 


preprocess(Q,T): 

1 . if ns(Q) < 1 /m s then return 

2. apply Lemma 6.8 to the restriction of yis to Q 
to get b regions Qi, ■ • • ,Qb and cells 7ii ■ • •,76 

3 . for * = 1 to b do 

4. if 7 i fl T is safe then store M{q) for an arbitrary point q € 7 * (~1 T 

5. else preprocess(<5j fl Q, 7 i fl T) 


on-line-query(q, Q , T): 

1 . if ns(Q) < 1 /m s then compute A i(q) directly and return 

2 . locate the region Qi containing q 

3. if 7 j (~l T was marked as safe then look up the stored answer and return 

4. else on-line-query^, Qi D Q, 7 * D T) 

The space of the tree generated by preprocess(R d ,R d ) is only 0(m s ), and so the space for the data 
structure for (A) dominates. We will not focus on the preprocessing time, which depends on the construction 
time for Lemma 6 . 8 , which in turn depends on the distribution S. The preprocessing can be done efficiently, 
for example, for a discrete ?r-point distribution and for many other distributions. 

In algorithm on-line-query, line 1 takes 0(log?n) time by (A); note that this case occurs only when the 
query point q participates in D(log?n) levels of the recursion, and we can account for the cost by charging 
one unit to each level of the recursion. Line 2 takes 0(1) time for a constant b. We can adapt the previous 
proof of Theorem 6.6 for the rest of the query time analysis: 

Theorem 6.10. After A4 has been preprocessed for (A) and preprocess(R d ,R d ) has been executed, al¬ 
gorithm on-line-query^, R d , R d ) runs in 0(fH{S) +1 + k) expected time for output size n, for a query point 
q randomly chosen from the distribution S. 

Proof. Let rij be 1 if the query point participates at level j of the recursion, and 0 otherwise. Then the 
query time is asymptotically bounded by ]>// Uj. 

Let II be any respectful partition of S. Consider a region Sk in II, enclosed in a safe simplex A*,. Fix a level 
j. Let Qfs and r,;’s be the regions and cells at level / of the recursion tree. Each Qi satisfies ns(Qi) < 1 /0(6)A 
The number of Tj’s that intersect the boundary of A*, is at most 0(6 1_e ) J . Thus, Pr \(rij = 1) A (q £ Sk)] < 



mm{jis{Sk)- Oib 1 e ) J • 1/0(&) J } for a point q randomly chosen from the distribution S. Since the Sk s cover 
R d , with a double summation we have, for a sufficiently large constant b , 


E 


E 


3 


which yields the theorem. □ 


< £]Tmin{/z s (S fc ), 1/0(6)^} 

= EEminl^ 5 ^)’ 1 / 0 ^)^} 

fc j 

C X)0(/is(S fc )(log(l/Ais(Sfc)) + l)) 

k 

= 0{U{ n) + i), 


For orthogonal-type problems, we can again work in the comparison model, by replacing simplicial and 
convex polyhedral cells with axis-aligned boxes. 

We can apply our framework to on-line versions of several problems: 

(6) On-line point location queries in 2-d: We immediately obtain optimal 0(1~L(S) + 1) expected query cost, 
with an 0(m)-space data structure for a subdivision of size m, for any given distribution S. The query 
algorithm works in the linear decision tree model. This on-line point location result is known before 
[Arya et al. 2007a; Arya et al. 2007b; Collette et al. 2012; Iacono 2004] (some of these previous work 
even optimize the constant factor in the query cost). 

(1) On-line halfspace range reporting queries in 2-d and 3-d: Here, we map query lower halfspaces to points 
by duality. For (A), data structures for 2-d and 3-d halfspace range reporting with 0(logm + re) worst- 
case time are known, with 0(m) space [Chazelle et al. 1985; Afshani and Chan 2009]. We thus obtain 
optimal 0(H(S) + 1 + re) expected query cost for output size re, with an 0(m)-space data structure for 
a given 2-d or 3-d m-point set, for any given distribution S. The query algorithm works in the linear 
decision tree model. This result is new. 

(2) On-line dominance reporting queries in 2-d and 3-d: The story is similar to halfspace range reporting. 
The query algorithm now works in the comparison model. 

(4) On-line orthogonal range reporting/counting queries in 2-d: Here, we map query rectangles to points 
in 4-d as in Section 6.1. For (A), data structures for 2-d orthogonal range reporting with 0(logm + re) 
worst-case query time are known, with 0(mlog e m) space; and data structures for 2-d orthogonal range 
counting with O(logm) worst-case query time are known, with 0(m) space [Chazelle 1988]. For reporting, 
we thus obtain optimal 0('H(S) + 1 + k) expected query cost for output size re, with an 0(m log £ m)-space 
data structure for a given 2-d m-point set, for any given distribution S; for counting, we get optimal 
0(H(S) + 1) expected query cost with an 0(m)-space data structure. The query algorithm works in the 
comparison model. This result is apparently new, as it extends Dujmovic, Howat, and Morin’s result 
on 2-d dominance counting [Dujmovic et al. 2012] and unintentionally answers one of their main open 
problems (and at the same time improves their space bound from O(mlogm) to 0(m)). 

Remark 6.11. Some months after the appearance of the conference version of the present paper, a similar 
general technique for distribution-sensitive data structures was rediscovered by [Bose et al. 2010]. 


7. DISCUSSION 

Although we have argued for the order-oblivious form of instance optimality, we are not denigrating adaptive 
algorithms that exploit the order of the input. Indeed, for some geometric applications, the input order may 
exhibit some sort of locality of reference which can speed up algorithms. There are various parameters that 
one can define to address this issue, but it is unclear how a unified theory of instance optimality can be 
developed for order-dependent algorithms. 

We do not claim that the algorithms described here are the best in practice, because of possibly larger 
constant factors (especially those that use Matousek’s partition trees), although some variations of the ideas 
might actually be useful. In some sense, our results can be interpreted as a theoretical explanation for why 
heuristics based on bounding boxes and BSP trees perform so well (for example, see [Andrews et al. 1994] 
on experimental results for the red/blue segment intersection problem), as many of our instance-optimal 
algorithms prune input based on bounding boxes and spatial tree structures. 




Fig. 8. Definition of ^(S): In this instance, points pi,p 2 ,P 3 are 
dominated by <71 , <72 , <73 , and so |F(pi)| = |F(p 2 )| = |F(p 3 )| = 
n = 12. Points p 4 ,p$,pe, q 2 are dominated only by 92 , and so 
|F(p4)| = |F(p 5 )| = |F(p 6 )| = |F( W )| = 7. Similarly |F(p 7 )| = 
\F(ps)\ = |F(pb)| = |F(®,)| = 4 and |F( gi )| = 7. Thus, F(S) = 

3 *°s ll + 5 lo § T + 4 lo § x- 


Note that specializations of our techniques to 1-d can also lead to order-oblivious instance-optimal results 
for the multiset-sorting problem and the problem of computing the intersection of two (unsorted) sets. 
Adaptive algorithms for similar 1-d problems (e.g., [Munro and Spira 1976]) were studied in different settings 
from ours. 

Not all standard geometric problems admit nontrivial instance-optimal results in the order-oblivious set¬ 
ting. For example, computing the Voronoi diagram of n points or the trapezoidal decomposition of n disjoint 
line segments, both having @(n) sizes, requires fl(nlogn) time for every point set by the naive information- 
theoretic argument. Computing the (Loo-)closest pair for a monochromatic point set requires fl(nlogn) time 
for every point set by our adversary lower-bound argument. 

An open problem is to strengthen our lower bound proofs to allow for a more general class of test functions 
beyond multilinear functions, for example, arbitrary fixed-degree algebraic functions. 

It remains to see how widely applicable the concept of instance optimality is. To inspire further work, 
we mention the following geometric problems for which we currently are unable to obtain instance-optimal 
results: 

(a) reporting all intersections between a set of disjoint red (nonorthogonal) line segments and a set of 
disjoint blue line segments in 2-d; 

(b) computing the L 2 - or Loo-closest pair between a set of red points and a set of blue points in 2-d; 

(c) computing the diameter or the width of a 2-d point set; 

(d) computing the lower envelope of a set of (perhaps disjoint) line segments in 2-d. 

Finally, we should mention that all our current results concern at most logarithmic-factor improvements. 
Obtaining some form of instance-optimal results for problems with w(nlogn) worst-case complexity (for 
example, off-line triangular range searching, 3SUM-hard problems, ...) would be even more fascinating. 

APPENDIX 

A. AN ALTERNATIVE PROOF FOR 2-D MAXIMA 

In this appendix, we describe an alternative approach to the 2-d maxima problem, which uses a new definition 
of a difficulty measure and a vastly different lower-bound proof based on an interesting encoding argument. 
This approach is more specialized and does not seem to work for 3-d maxima or other problems, but the 
lower-bound proof has the advantage of being generalizable to nondeterministic algorithms. 

We begin by defining a measure of difficulty F(S) specific to the 2-d maxima problem, which is seemingly 
different from the structural entropy H{S) defined previously. The new definition appears simpler in the 
sense that we do not need to take the minimum over all partitions but measure the contribution of each 
point directly, but as a byproduct of our analyses, F(S) is asymptotically equivalent to n'H(S) (which is why 
we do not give it a name). 

Definition A.l. Given a point set S , let qi,...,qh denote the maximal points of S from left to right, 
with go = (—00,00) and qh+i = (00, —00). Given a point p € 5, let g *,... ,qe be all the maximal points that 
dominate p. Define F(j>) to be the subset of all points in S in the slab ( qi-\.x , qt+i.x) x K., where we use p.x 
and p.y to denote the x- and y-coordinates of p. Define F(S) = X] P eS log(n/|F(p)|). (See Figure 8.) 

For the upper-bound side, we use the same algorithm and an analysis similar to before: 

Theorem A.2. Algorithm maxima2d(5') from Section 2.1 runs in 0(J-(S) + n) time. 

PROOF. We proceed as in the proof of Theorem 2.3, but a simpler argument replaces the second paragraph: 
Fix a point p £ S. Let qi,...,qe be all the maximal points that dominate p. Fix a level j. If | F(p)\ > [n/2- J J, 










then by (i), that some maximal point from {< 7 ,;,..., qf\ must been discovered, and by (ii), this implies that 
p does not survive level j. Thus, p can survive only for 0{\og{n/\F(jp)\) + 1) levels. We can asymptotically 
bound the running time by JN n j € 0(Y^ p {\og(n/\F(p)\) + 1)) = 0(F(S) + n). □ 

For the lower-bound side, we first consider a slightly strengthened problem which we call maxima with 
witnesses: given a point set S, report (the indices of) all maximal points in left-to-right order, and for each 
nonmaximal point p in S , report a maximal point (a witness) that dominates p. 

Theorem A.3. OPT(S'), OPT avg (S') £ tt(F(S) +n) for the 2-d “maxima with witness” problem in the 
comparison model. 

Proof. The proof is a counting argument, which we express in terms of encoding schemes (see [Demaine 
and Lopez-Ortiz 2003; Golynski 2009] for more sophisticated examples of counting arguments based on 
encoding/decoding). We will describe a way to encode an arbitrary permutation a of S, so that the length of 
the encoding can be upper-bounded in terms of the running time of the given algorithm A on input a. Since 
the worst-case encoding length must be at least log(n!), the running time must be large for some permutation 
a. (All logarithms are in base 2.) 

To describe the encoding scheme, we imagine that the permutation a is initially unknown, and as we 
proceed, we record bits of information about a so that at the end, a can be uniquely determined from these 
bits. In the description below, we distinguish between an input point, as represented its index in the input 
permutation a (its actual location is not necessarily known), and an actual point in S (its coordinates are 
known but its index in a is not necessarily known). At any moment, if we know which input point corresponds 
to an actual point p, we say (naturally) that p is known. 

We first simulate the algorithm on a and record the outcomes of the comparisons made; this requires 
Ta(ct) bits (recall that Ta(<j) denote the number of comparisons made by A on a). Let M be the list of 
maximal input points returned. For each input point qi , let W(qt) be the list of all nonmaximal input points 
that have < 7 ,; as witness. For each maximal actual point, we record its position in M, using h [log ft] bits in 
total. Now all maximal points are known. 

We process the nonmaximal actual points of S from left to right, and make them known as follows. To 
process an actual point p , let qi, ... ,qj be all the maximal points that dominate p, which are all known. 
Observe that p must be in W(qi) U • • • U W(qj). Let L be all the points that are left of p , which are all 
known. We record the position of p in the list (W (qi) U • • • U W (qj)) — L of input points (say, ordered by their 
indices). This requires [log(|(IT(< 7 ,;) U • • • U W(qj)) — L |] bits. Observe that W(qi) U • • • U W(q.j) is contained 
in (— 00 , qj.x) xl. So, (W(qi) U • • • U W(qj)) — L is contained in the subset F(p) from our Definition A.l -a 
lucky coincidence. Thus, the number of bits required is [log |F , (p)|]. Now p is known and we can continue 
the process. 

By our construction, any permutation er of S can be uniquely decoded from its encoding, for any given 
set S. The encoding has total length at most 

Ta(ct) + ft logft + ^ log |F(p)| + 0(?i) = Ta(ct) + ft log h + n log n - F(S) + 0(n). 
v 

Taking the maximum over all permutations cr of S, we thus obtain log(n!) < Ta(S) + ft. log ft + ?zlogn — 
F(S) + 0(n), yielding Ta(S) + n + ft log ft £ kl(F(S)). Combined with the trivial lower bound f l(n) and the 
naive information-theoretic lower bound Ta(S) £ fl(ftlogft) (as the problem definition requires the output 
to be in sorted order), this implies that Ta(S) £ f l(F(S) + n). 

The proof works in the random-order setting as well: In any encoding scheme, at most a fraction 2~ cn of 
the n\ permutations can have encoding length less than log(?z!) — cn for any constant c. Thus, for a random 
permutation a, with high probability the encoding length is at least log(n!) — O(n), implying Ta(ct) £ 
fl(F(S) + n). In particular, T™ & (S) £ kl(F(S) + n). □ 

Combining the above theorem with the following observation yields a complete proof of the f l(F(S) + n) 
lower bound: 

Observation A.4. Any algorithm for the 2-d maxima problem in the comparison model can be made to 
solve the 2-d “maxima with witnesses” problem without needing to make any additional comparisons. 

Proof. Consider the partial order -< x over S formed by the outcomes of the ^-comparisons made by the 
maxima algorithm A. Define the partial order -< y similarly. Fix a nonmaximal point p. We argue that there 
must be a point q £ S such that p -< x q and p -< y q. Suppose that every q £ S has p -ft x q or p q. Consider 



modifying the point set as follows: First, increase the ^-coordinates of p and all points in {q £ S : p -< x q} by 
a sufficiently large common value; this does not affect the outcomes of the comparisons made, and ensures 
that all points q with p -fix q now have p.x > q.x. Similar, increase the ^/-coordinates of p and all points in 
{q £ S : p -< y q} by a sufficiently large common value; all points q with p fi y q now have p.y > q.y. Then 
every q £ S now has p.x > q.x or p.y > q.y 1 i.e., p is now maximal, and the algorithm would be incorrect on 
the modified point set: a contradiction. 

For every nonmaximal point p , we can thus find a witness point q that dominates p, without making any 
additional comparisons. One issue remains: the witness point may not be maximal. If not, we can change p’s 
witness to the witness of the witness, and so on, until p's witness is maximal. 

Finally, note that we do not require the given algorithm A to report the maximal points in left-to-right 
order. We argue that at the end we already know the az-order of the maximal points. Suppose that q fi x q' 
for two consecutive maximal points q and q'. Consider modifying the point set as follows: Increase the x- 
coordinates of q and all points in {p £ S : q -< x p} by a sufficiently large common value; this does not 
affect the outcomes of the comparisons made, and ensures that we now have q.x > q'.x (while maintaining 
q.y > q'.y). Then q' is now nonmaximal, and the algorithm would be incorrect on the modified point set: a 
contradiction. □ 

Remark A.5. The proof can be modified for weaker versions of the problem, for example, reporting just 
the number of maximal points (or its parity). 

The proof does not appear to work for problems other than maxima in 2-d. One obvious issue is that 
Observation A.4 only applies to comparison-based algorithms for orthogonal-type problems. Even more 
critically, the proof of Theorem A.3 relies on a coincidence that is special to 2-d maxima. 

Curiously, this lower-bound proof holds even for nondeterministic algorithms, i.e., algorithms that can 
make guesses but must verify that the answer is correct; here we assume that each bit guessed costs unit 
time. In the proof of Theorem A.3, we just record the guesses in the encoding. The previous proofs of instance 
optimality by [Fagin et al. 2003] and [Demaine et al. 2000] all hold in nondeterministic settings. Perhaps this 
strength of the proof prevents its applicability to other geometric problems, whereas our adversary-based 
proofs more powerfully exploits the deterministic nature of the algorithms. 
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